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Science and satire 


The terrorist attacks in Paris were an assault on the fundamental values of free and democratic 
societies. Researchers, and humorists, must combat obscurantism everywhere. 


to be executed for blasphemy; his list of ‘crimes’ included a refusal 

to remove his hat as a religious procession passed by. Eighteenth- 
century France was one of the first nations to push back against the 
tyranny of religious authority that stifled free thought across Europe at 
the time — and continues to do so in other places. That proud legacy 
— and the part that both science and satire played in promoting the 
contrasting values of the Enlightenment — is worth reflecting on as 
millions across the world struggle to make sense of the horrific events 
in Paris last week. 

The terrorists who murdered 17 people, including 8 staff members 
of the French weekly satirical magazine Charlie Hebdo, falsely claimed 
to act in the name of Islam. On the contrary, the perpetrators represent 
a fanaticism that would stifle freedoms and science in the Arab and 
Islamic world, and beyond. The means used in the eighteenth century 
remain among the best options to combat this warped world view today. 
Free scientific thinking and satire — both religious and political — are 
crucial in challenging and undermining dogma and authoritarianism. 

It is no coincidence that the eighteenth-century French writer, 
intellectual and activist Voltaire, a leading Enlightenment figure, was 
both an outspoken and irreverent satirist of religion and a leading 
proponent of the natural sciences as the successor to religion and philo- 
sophical reasoning as the main route to knowledge. The Enlightenment 
culminated in the French Revolution, and the resulting 1789 Declara- 
tion of the Rights of Man and of the Citizen, which explicitly protected 
free speech, including the right to freely criticize religious views. 

Philippe Val, a former editor of Charlie Hebdo, has compared satire 
to science, saying that both are methods of pursuing ‘truth’ in the face 
of dogma. Charlie Hebdo is no Voltaire, but Val has a point. Satire, wit 
and mockery remain surprisingly effective ways to voice dissent, and to 
highlight the absurdities, hypocrisy, injustice and oppression of authori- 
tarian regimes and religious obscurantism. 


I: 1766, Jean-Francois Lefebvre became the last person in France 


FREEDOM OF THOUGHT 

Satire is widely credited with helping to undermine the authority of 
regimes during the revolutions of the Arab Spring, starting in 2011. It 
continues to be a powerful means to air grievances and call for further 
freedoms not only in Tunisia, where the revolution has prompted a 
relatively successful transition to democracy, but also in other Arab 
countries where uprisings have so far ended in chaos. 

Although satire may survive, in the Arab world the development of 
science has long been hampered. The landmark Arab Human Develop- 
ment Report 2003, written by Arab scholars and sponsored by the 
United Nations Development Programme, identified authoritarianism, 
and the resulting lack of open democratic societies that uphold freedom 
of speech, as the main obstacle. Greater freedoms, and science itself, it 
argued, work together to build free and creative societies. 

The Arab spring raised hope among scientists in the region that 


greater freedoms and democracy would usher in societies based on 
science and other forms of knowledge. The same fanaticism that 
brought bloodshed to the streets of Paris jeopardizes those hopes, 
and must be resisted. 

The Paris killings are also a reminder that the hard-won freedoms that 
most readers of Nature enjoy must not be taken for granted, but must be 
continually defended. In many countries, it is taken as given that one is 
free to express opinions, criticize government policy and research any 
subject without worrying about intimidation, retaliation or imprison- 
ment, much less being shot in the head. 


“Freedoms must Those rights are increasingly being eroded, 
not be taken however — ironically, often in political over- 
for granted, reactions to terrorism — including through 
but must be anti-terrorism laws that roll back civil liberties, 
continually increasingly invasive surveillance states and 


defended.” government oppression of legitimate dissent 
such as the Occupy movement. Politics and 
other forces can sometimes lead to censorship of lines of enquiry — such 
as gun research in the United States — as well as more insidious forms 
of influence on areas of research. 

Scientists and satirists everywhere must remain vigilant to protect 
liberties, and to fight obscurantism in any form. Social science and other 
research is needed to better understand the origins of violent fanati- 
cism, conflict and intercommunal strife. Tackling terrorism is about 
much more than repressive measures. It demands long-term political 
and social initiatives, and policies to help to address the root causes. 

The heritage of Voltaire and the Enlightenment explains why the 
French people have reacted much more strongly to the latest attacks 
than to the many acts of terrorism they have endured in the past. The 
terrorists attacked a symbol of the right to free expression. Free speech 
does have its limits, and many countries rightly impose laws that, for 
example, outlaw the incitement of religious or racial hatred. But the 
right to criticize, and even to mock, religion, fanaticism, superstition 
and indeed science is not only rightly protected by law in France, but is 
enshrined there, as in many countries, as a fundamental human right. 

The killing also attacked other symbols of the Republic, including 
a kosher supermarket — one symbol ofa multicultural society — and 
the police. One of the officers killed, Ahmed Merabet, was Muslim, 
and his brother aptly remarked that the killers were “false Muslims” 
and that Merabet had been “proud to represent the French police and 
to defend the values of the Republic — liberty, equality, fraternity”. 

Marches for national unity on Sunday brought some 4 million peo- 
ple onto the country’s streets, the largest turnout in French history; the 
crowds in Paris surpassed those that welcomed allied troops liberation 
of the city during the Second World War. Fundamentalists throughout 
history have sought to subjugate freedoms, including freedom of expres- 
sion and thought. More than 200 years ago, France rejected them with a 
“Non!” that echoed across the world. It has done so again. = 
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Deep mysteries 


Arguments among ocean scientists show how 
much remains to be discovered. 


the abyss. The ocean is our ultimate ancestral home and a constant 
and alluring source of opportunity, danger and mystery. 

Certainly, those scientists who spend their careers peering into the 
seven seas are quick to dredge up the idea that we know more about 
celestial bodies than we do about the oceans. Who can blame them 
when outer space grabs the attention? Witness the interest in the Rosetta 
comet mission or last week’s announcement of more exoplanets — now 
counted in the hundreds — which received wall-to-wall press coverage. 

News about the oceans tends to be bad, and feeds the dark imagery — 
such as Malaysia Airlines flight MH370, which is believed to have gone 
missing over the Indian Ocean with huge loss of life, or the enormous 
car-carrying ship that ran aground a fortnight ago off the UK coast. 

Some scientists are now questioning whether this drip feed of 
negativity about the oceans is reflected in the attitudes of researchers. 
Carlos Duarte of the University of Western Australia in Crawley and 
his colleagues claim in a provocative paper that ocean science focuses 
too much on the narrative of man-made disaster (C. M. Duarte et al. 
BioScience http://dx.doi.org/10.1093/biosci/biul 98; 2014). Their study 
argues that “doom and gloom media accounts” of the terrible state of 
the oceans are frequently not based on strong evidence. 

Some marine situations painted as global calamities are well 
supported by science, they acknowledge, such as the depletion of fish 
stocks. But others are far from certain; for example, global explosions 
in jellyfish numbers and the predicted impact of ocean acidification on 
organisms such as corals. The media can be prone to exaggeration, but 
marine researchers, too, “may not have remained sufficiently skeptical’, 
say Duarte and his co-authors. Some ‘calamities’ are now accepted by 
researchers, and repeated as truth owing to problems with citations and 
observation bias, and to the desire to help solve these problems. This is 


I: is the ultimate metaphor for the unknown: the deep; the depths; 


a contentious thesis, and other researchers dispute it. As usual in such 

spats, both sides argue that more work is needed. It is hard to disagree. 
Some progress on one high-profile marine problem — the fate of 
coral reefs — is published on Nature’s website this week. The study 
uses years of data from the Seychelles to unpick the factors that allowed 
coral reefs there to bounce back and flourish after a mass ‘bleaching’ 
event in 1998 (N. A.J. Graham et al. Nature http://dx.doi.org/10.1038/ 
naturel14140; 2015). Some 90% of the corals 


“To neglect the died and turned the reefs pale. 

oceans because Around half of the reefs recovered from 
it is cheaper the bleaching, but more worryingly, whether 
to get good they were in a marine protected area seemed 
results onlandis to make no difference to this recovery. Stud- 


foolish.” 


ies such as this will be increasingly important 
as climate change forces alterations in many 
marine systems. Even in well-studied ecosystems such as reefs, the 
effects of climate change and other human activity are unclear, and 
getting answers from such complex systems requires huge amounts of 
data. The Seychelles analysis used 17 years of data to come to its con- 
clusions — data that were being collected when European scientists 
were still discussing parts of the Rosetta mission. 

Some studies need even more data. Also published on Nature’s web- 
site this week is a paper on sea-level rise that looks at tide data going 
back to the 1900s (C.C. Hay et al. Nature http://dx.doi.org/10.1038/ 
nature14093; 2015). Such data have previously been used to estimate a 
sea-level rise of between 1.6 and 1.9 millimetres a year over the twenti- 
eth century. By contrast, Hay et al. find a lower rate of 1.2 mm per year. 
But their estimates for more-recent changes are 3mm a year between 
1993 and 2010 — in line with previous estimates. This implies that the 
recent increase in sea-level rise may be bigger than previously thought: 
a small but rather important difference. 

Information on sea levels from some sites dates back to the eight- 
eenth century. Yet the real value of this information is only now emerg- 
ing. Ocean data are expensive to collect. Ships are costly to build, 
equally so to run. But to neglect the oceans because it is cheaper to get 
good results on land is foolish. The proverbial drunk searches for his 
keys under a streetlight because that is the only place that he can see. 
The sea may be mysterious, but some mysteries can be solved. m 


Out of the bag 


The preference for either cats or dogs affects 
science more than you might think. 


neatly summarized by the late British journalist Christopher 

Hitchens. “Owners of dogs will have noticed that, if you provide 
them with food and water and shelter and affection, they will think you 
are god,” Hitchens observed. “Whereas owners of cats are compelled to 
realize that, if you provide them with food and water and shelter and 
affection, they draw the conclusion that they are god.” 

Many see the apparent conflict between the attitudes of the two 
animals mirrored in the personalities of those who choose to own 
one or the other. Cat owners, it is claimed, are more neurotic and open 
with their emotions. Dog people are more disciplined and outgoing 
(and not just for long, muddy walks). Science has little conclusive to 
say on the matter, but accident statistics do: a 2010 study of non-fatal 
injuries in the United States found that more than seven times as many 
people were likely to hurt themselves in falls caused by dogs as by cats. 
(Cat owners are allowed a feeling of smug superiority here, because 
the actions of dog owners were as much to blame for the accidents in 
many cases as the animals themselves.) 


Tos much-discussed difference between pet cats and dogs was 
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As we discuss in a News story on page 252, the differences between 
dog folks and cat people extend all the way up to senior scientists, 
including some at the US National Institutes of Health, at least accord- 
ing to feline researchers. Work on cats has been overlooked for years, 
they complain, partly because “there were more powerful people 
interested in dogs”. The complete dog genome was sequenced a dec- 
ade ago, and has produced hundreds of genes linked to canine traits 
and diseases, but a high-quality version of the first cat genome was 
published only last year. 

Now, the cat lobby is trying to catch up, and feline fanciers every- 
where have their chance to help. Just US$7,500 will pay for a single 
cat’s genome to be sequenced, and the funders — pet owners, breed- 
ers, pet-food companies — get to choose the breed or even the indi- 
vidual animal. Together, the project organizers hope that comparisons 
between dozens of these separate genomes will shed more light on cat 
diseases and genetic mutations that may drive similar conditions in 
humans — just as they have already for dogs. 

Indeed, as scientists know, cats and dogs have more in common 
than it might seem. Cats and dogs do not even have to fight like, well, 
cat and dog. Plenty of people own both animals, and research on these 
households offers some advice: get the cat first, and get both while 
they are young. 

Dont fancy either? Then take inspiration 
from Winston Churchill. “I am fond of pigs,’ he 
said. “Dogs look up to us. Cats look down on us. 
Pigs treat us as equals.” m 
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SIMONA BEDNAREK 


W R L D V EW A personal take on events 


industry — bibliometrics — that would have us believe that 
it is easy: count journal articles, sort them according to the 
impact factors of the journals, and count all the citations. 

Science managers and politicians seem especially fond of such ways 
to assess ‘scientific quality. But many scientists also accept them, and 
use them in hiring and funding decisions. They are drawn to the 
alleged objectivity of bibliometrics. Indeed, one sometimes hears that 
scientists should be especially ready to apply scientific methods to 
their own output. However, scientists will also be aware that no good 
science can be built on bad data, and we are in a unique position to 
judge the quality of the raw data of bibliometrics, because we generate 
them through our citation behaviour. 

The underlying assumption of bibliometrics is 
that, by citing, scientists are engaging in an ongo- 
ing poll to elect the best-quality academic papers. 
But we know the real reasons that we cite. Chiefly, 
it is to refer to results from other people, our own 
earlier work or a method; to give credit to partial 
results towards the same goal; to back up some 
terminology; to provide background reading for 
less familiar ideas; and sometimes to criticize. 

There are less honourable reasons, too: to boost 
a friend’s citation statistics; to satisfy a potential 
big-shot referee; and to give the impression that 
there is a community interested in the topic by 
stuffing the introduction with irrelevant citations 
to everybody, often recycled from earlier papers. 

None of these citations — for good reasons 
or bad — express the opinion that the paper in 
question is a remarkable scientific achievement. 

Consequently, highly cited papers often contain popular (but other- 
wise unimpressive) concepts or methods. If you have a favourite well- 
cited paper, it is a sobering experience to check 20 random citations. 
They typically contain little appreciation for the quality of the work. 

To be sure, selection for an academic job guided mainly by cita- 
tion statistics or papers in high-impact journals will get better results 
than flipping a coin. But it is blind to the difference between someone 
who creatively develops a research agenda — and is likely to be doing 
that in ten years — and someone who grinds out papers in a narrow, 
fashionable subfield. 

Many negative effects of bibliometrics come not from using it, but 
from the anticipation that it will be used. When we believe that we 
will be judged by silly criteria, we will adapt and behave in silly ways. 

A good example is the distortion in the jour- 


| | ow do we recognize a good scientist? There is an entire 


nal landscape — and with it the changes in the DNATURE.COM 
style of papers — that arose when the journal _ Discuss this article 
impact factor began to be taken seriously as a __onlineat: 


proxy for reputation. go.nature.com/q3ede3 


WHEN WE BELIEVE 
THAT WE WILL BE 
JUDGED BY 


SILLY 
CRITERIA 


WE WILL ADAPT AND 
BEHAVE IN 


SILLY WAYS. 


The focus on bibliometrics 
makes papers less useful 


Forcing research to fit the mould of high-impact journals weakens it. Hiring 
decisions should be based on merit, not impact factor, says Reinhard Werner. 


For example, when Physical Review Letters (PRL) split from Physi- 
cal Review, it was intended to allow speedier publication of short 
announcements, which had previously been sent as unrefereed let- 
ters to the editor of Physical Review. 

It is easier to reach high impact with this format, so the ‘reputa- 
tion’ shifted from the standard journal to the letters section. Although 
there is no reason that shorter papers should be scientifically better 
than long ones, many authors now happily mutilate their work to stay 
under PRL’s page limit, rendering papers less readable and less useful. 

Another example is the way Nature became the top journal for 
experimental physicists. Life scientists are more numerous and use 
more citations than physicists, so the impact factors of Science and 
Nature, which cover all sciences, easily beat that 
of any non-review physics journal. Despite the 
higher impact factor, there is no reason why a 
paper written for a broad audience should be 
scientifically more valuable than one with an in- 
depth technical discussion. In fact, in pitching 
for such an audience, authors often leave out the 
tricky parts, keep technical terms out of their titles, 
and overstate their conclusions in broad terms. 

What can we do? Simply, individual scientists 
must resist the trend of making bibliometrics a 
central plank in their decision-making processes. 
And we must make this public, perhaps by stat- 
ing in job adverts that papers will be judged by 
scientific merit and not by journal impact factor. 

Once a hiring decision is made, we should 
resist the temptation to justify it by quoting the 
candidate's bibliometrics to administrators. This 
reinforces the damaging idea that hiring decisions 
could be made by administrators in the first place, and makes it harder 
to justify decisions that do not follow the metrics the next time round. 

As the tyranny of bibliometrics tightens its grip, it is having a dis- 
astrous effect on the model of science presented to young research- 
ers. For example, a master’s student of mine moved to a renowned 
research institute for his PhD. Like many institutes, this one boasts of 
its performance in terms of publications in high-impact journals. So 
my student was told: “If you cannot write up your research in a form 
suitable for Nature or Science or Physical Review Letters, don't bother 
to even do it” Such advice, driven by the appeal of metrics to funders, 
is common but horribly misguided. 

If we raise scientists to be driven by such extrinsic motivation alone, 
then why should they not follow the logic to its natural conclusion, 
and run off to become well-paid bankers instead? m= 


Reinhard Werner is professor of theoretical physics at Leibniz 
University in Hanover, Germany. 
e-mail: reinhard.werner@itp.uni-hannover.de 
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RESEARGH HIGHLIGHTS scien. 


MICROBIOLOGY 


Gut microbes’ 
survival tactics 


Gut bacteria protect 
themselves from host 
inflammation by modifying 
their outer membranes. 

Immune responses designed. 
to wipe out infection could, 
in theory, also perturb 
helpful flora that reside in the 
gut. To find out how these 
microbes resist the effects 
ofinflammation, Andrew 
Goodman of Yale University 
in New Haven, Connecticut, 
and his colleagues studied 
17 bacterial species that 
normally live in the human 
gut. They found that the 
microbes were all resistant to 
antimicrobial peptides released 
by hosts to kill pathogens. 

In the bacterium Bacteroides 
thetaiotaomicron, this resilience 
was linked to expression of 
a protein called LpxE, which 
neutralized the negative 
charge of the cell membrane, 
preventing the positively 
charged peptides from binding 
to the gut microbe’s surface. 
Mutants that did not express 
LpxF were outcompeted in 
mouse guts by other microbes 
during inflammation. 

Science 347, 170-175 (2015) 


MATERIALS 


Arsenic formsa 
semiconductor 


Single-atom-thick layers of 
arsenic and antimony could 
be efficient semiconductors 
that have more applications 
than other two-dimensional 
materials. 

Atom-thick materials 
can have unique electronic 
and optical properties, but 
some operate only at certain 
wavelengths of light, owing 
to small ‘band gaps: On the 
basis of quantum mechanical 
calculations, Zhongfang 


Monkey in the mirror 


Macaques can be trained to recognize themselves in a mirror, 
the first such observation in any monkey species. 

Most animals encountering their reflections act as if they 
are seeing another creature. To find out whether monkeys 
can be trained to recognize their own reflections, Neng Gong 
and his team at the Shanghai Institutes for Biological Sciences 
placed rhesus macaques in front of a mirror and shone a 
low-powered laser beam on their faces to produce a mild heat 
irritation. They rewarded animals when they touched the 
irritating spot on their face (pictured). 

After 12 to 38 days of this regimen, 5 out of 7 macaques 
using the mirror touched an odourless mark applied to their 
faces. With a mirror in their cages, some of these monkeys 
seemed to use it to explore parts of their bodies that they 
could not otherwise see. It is not clear, however, whether these 
behaviours mean that the monkeys have higher cognitive 
abilities such as self-awareness, the authors say. 

Curr. Biol. http://doi.org/x54 (2015) 


Chen at the University of 
Puerto Rico in San Juan, 
Haibo Zeng at the Nanjing 
University of Science and 
Technology in China and their 
colleagues predict that arsenic 
and antimony can switch 
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from being semi-metallic in 
bulk to semiconducting as 

a single-atom layer. These 
materials, called arsenene 
and antimonene, have wider 
band gaps than other two- 
dimensional semiconductors, 


meaning that they could be 
used in short-wavelength 
optoelectronic devices such 
as blue or ultraviolet light- 
emitting diodes. 

The authors say that such 
materials could soon be 
synthesized in the lab. 
Angew. Chem. Int. Edn 
http://doi.org/f2x29z (2015) 


EVOLUTION 


Lungs began with 
many chambers 


The lungs of ancestral, land- 
based vertebrates may have 
had multiple chambers rather 
than just one, as was believed. 
Markus Lambertz at the 
University of Bonn in Germany 
and his colleagues studied 
lung samples from 73 species 
ofamniotes, which include 
mammals, birds and reptiles. 
They also looked at lung 
development in embryos ofa 
gecko, Paroedura picta, which 
has single-chambered lungs. 
They found that all multi- 
chambered lungs shared key 
anatomical features, such as 
branching of the arteries. These 
features were present even in 
the single-chambered lungs of 
lizards and snakes, and in the 
embryonic gecko lung. 
Ancestral amniotes evolved 
multi-chambered lungs as 
they shifted to life on land, the 
authors say. Some, however, 
may have later developed 
single-chambered lungs as they 
evolved into smaller creatures 
to maximize air space in the 
lungs, the team adds. 
Biol. Lett. 11, 20140848 (2015) 


SUSTAINABILITY 


Resource use 
peaks worldwide 


The rates at which humans 
consume multiple resources 
such as food and wood peaked 
at roughly the same time, 
around 2006. This means 


NENG GONG AND COLLEAGUES/CURR. BIOL. 


J. RODGERS, UNIV. ILLINOIS 


RS PEABODY MUS. PHOTO BY DONALD E. HULBERT 


that resources could be 
simultaneously depleted, so 
achieving sustainability might 
be more challenging than 

was thought. 

Ralf Seppelt of the Helmholtz 
Centre for Environmental 
Research in Leipzig, Germany, 
and his colleagues estimated 
the peak rate of extraction for 
27 resources. For 20 of them, 
mostly renewables such as meat 
and rice, the peak-rate years 
occurred between 1960 and 
2010, with many clustering 
around 2006. Only coal, gas, 
oil, phosphate, farmed fish 
and renewable energy have 
yet to peak. 

Humans use multiple 
resources to generate new 
ones and to meet basic needs, 
which could explain the 
synchronicity of peak usage, 
the authors suggest. 

Ecol. Soc. 19,50 (2014) 


MATERIALS 


Silicon buckles to 
form 3D shapes 


Researchers have created 

a variety of small, three- 
dimensional structures by 
buckling strips of silicon and 
other materials. 

Turning advanced two- 
dimensional materials into 
three-dimensional shapes 
has proved difficult. John 
Rogers at the University of 
Illinois at Urbana-Champaign 
and his colleagues added 
hydroxyl groups at specific 
locations along the length of 
silicon ribbons as narrow as 
800 nanometres and as thin as 
100 nm. They also added these 
groups in specific patterns to 
an elastic, stretched substrate, 
and allowed the silicon strips to 
bond to the substrate. 


When the substrate 
returned to its original shape, 
the silicon buckled to forma 
range of structures less than a 
millimetre wide (pictured), 
including helices, boxes and 
flowers. These could then 
be assembled into larger 
configurations. 

The method uses a variety of 
materials, such as metals and 
polymers, and has the potential 
to make structures for a wide 
range of electronic devices, the 
authors say. 

Science 347, 154-159 (2015) 


Mosquitoes gain 
resistance 


A malaria-carrying mosquito 
inherited insecticide- 
resistance genes from a related 
species, around the time 

that bed nets treated with 
insecticide were increasingly 
used in West Africa. 

Gregory Lanzaro at the 
University of California, Davis, 
and his colleagues analysed 
DNA from more than 1,000 
specimens of Anopheles 
coluzzii and Anopheles 
gambiae in Mali from 2002 
to 2012. They found that a 
group of genes, including 
one for insecticide resistance, 
from A. gambiae moved into 
A. coluzzii around 2006 when 
the two species mated. 

Campaigns to encourage 
the use of insecticide-treated 
bed nets began in 2005 in this 
region, and the authors suggest 
that the nets favoured the 
selection of hybrid, insecticide- 
resistant mosquitoes. 

Proc. Natl Acad. Sci. USA 
http://doi.org/x56 (2015) 


PHOTONICS 


Few photons make 
‘ghost image’ 


Physicists have captured an 
image of a wasp’s wing using 
less than one photon per pixel. 
Peter Morris and his 
colleagues at the University 
of Glasgow, UK, useda 
technique called ghost 
imaging, which uses pairs 
of photons whose positions 


RESEARCH HIGHLIGHTS MiiiSaiaa¢ 


SOCIAL SELECTION 


Popular articles 
on social media 


Unveiling secret funding decisions 


Many scientists struggle to understand why some grant 
applications succeed and others fail, perhaps explaining 

the online popularity of an article calling for increased 
transparency in the grant peer-review process. Writing in 
PLoS Biology, Daniel Mietchen, an evolutionary biologist 

at Berlin’s Museum of Natural History, argues that, among 
other things, all successful proposals and their reviews 
should be released to the public. “Open science is now old 
hat; here’s a call for open research funding processes,’ tweeted 
Dorothy Bishop, a neuropsychologist at the University of 
Oxford, UK. But some see a downside to transparency. James 
Johnson, a diabetes researcher at the University of British 
Columbia in Vancouver, Canada, tweeted: “I try to be open, 
but I’ve also had desperate people steal my ideas.” 


PLoS Biol. 12, e1002027 (2014) 


Based on data from altmetric.com. 
Altmetric is supported by Macmillan 
Science and Education, which owns 
Nature Publishing Group. 


are inextricably correlated, 

or entangled. One of each 
photon pair was either 
transmitted or absorbed by 
the wing, while its twin was 
used to reconstruct the image. 

Each photon was collected 
at a separate detector. To 
avoid recording stray photons 
that cause noise, a camera 
captured the ‘image-creating’ 
photon only when its partner 
from the wing was detected. 
Applying image-compression 
techniques, the authors 
further reduced the number 
needed to just 0.45 photons 
per pixel. 

Such techniques could be 
useful in biological imaging 
when high levels of light could 
damage the sample, they say. 
Nature Commun. 6, 5913 (2015) 


Maize’s journey 
out of Mexico 


DNA from the cobs of ancient 
maize (corn) shows how the 
crop was taken to the US 
southwest from Mexico. 
Maize was domesticated 
from the wild grass teosinte, 
an inedible weed, more than 


> NATURE.COM 
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6,000 years ago in 
southern Mexico, 
and later spread 
throughout the 
Americas. A 
team led by 
Rute da Fonseca 
and Thomas 
Gilbert at the 
University of 
Copenhagen 
analysed 
nuclear DNA 
from 32 maize 
samples from several 
archaeological sites spanning 
Mexico and the US southwest 
(pictured is a 5,000-year-old 
specimen). 

They conclude that maize 
arrived in the US southwest 
around 4,000 years ago along 
a highland route in central 
Mexico — not by a Pacific 
coastal route as other studies 
had suggested. Along the 
way, maize evolved into a 
sweeter and more drought- 
tolerant crop. 

Nature Plants http://doi.org/x6p 
(2015) 
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Netting neutrinos 


Officials in China broke 
ground at the Jiangmen 
Underground Neutrino 
Observatory (JUNO) in Jinji 
Town on 10 January. The 
underground experiment, an 
international collaboration 
designed to succeed China’s 
Daya Bay Reactor Neutrino 
Experiment, is scheduled to 
begin taking data in 2020. 
JUNO aims to shed light on 
the relative masses of the three 
known types of neutrino by 
studying particles emitted by 
nearby nuclear reactors or 
coming from space. Earlier 
last week, India’s government 
approved the India-based 
Neutrino Observatory 
(INO), at an estimated cost of 
US$240 million. 


Pipeline gets closer 
On 9 January, the Nebraska 
Supreme Court dismissed 

a lawsuit that had sought to 
block the planned route of 

a controversial oil pipeline. 
The Keystone XL pipeline is 
intended to transport up to 
830,000 barrels of crude oil 
per day from the tar sands 

of western Canada to the 

US Midwest, where it would 
connect with existing pipelines 
to refineries along the Gulf 

of Mexico. US President 
Barack Obama has not said 
yet whether he will approve 
the project, but on 7 January 
his administration said that 

he plans to veto legislation 
introduced by Republicans 

in Congress that would 
circumvent an ongoing review 
and fast-track construction. 


Nuclear disposal 


The private consortium 
charged with cleaning 

up a huge nuclear-waste 
repository in Sellafield, UK, 
has lost its contract, the UK 


The news in brief 


Divers find ossuary in Madagascan cave 


A giant trove of bones of extinct lemurs and 
other animals has been found in an underwater 
cave in Madagascar’s Tsimanampesotse 
National Park, researchers announced last 
week. The specimens recovered so far include 
the 30-centimetre-long head of a massive ‘koala 
lemur’ (Megaladapis) and dozens of giant 
lemurs (Pachylemur insignis), but thousands 

of bones may still lie on the silty floor. Lemurs 
are unique to the island country, and the 


government announced on 
13 January. Last year, a cross- 
party group of politicians 
complained that costs had 
increased at the site and that 
targets for the reprocessing 
of waste had been missed. 
Costs on one section of the 
repository alone increased 

by more than £300 million 
(US$450 million) between 
March 2012 and September 
2013, while estimated 
completion dates on another 
project were put back by 
years. Control of the company 
set up to manage the clean-up 
will now switch from the 
private consortium to the UK 
Nuclear Decommissioning 
Authority. 
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Philae still missing 
The search for Philae, the 
robotic lander that hada 
bouncy touchdown ona comet 
on 12 November, has become 
more difficult. The European 
Space Agency lost contact with 
the probe after it came to rest 
in a shaded spot away from its 
planned landing site and went 
into hibernation after its solar- 
powered batteries ran out. 
Rosetta, the spacecraft that 
deployed Philae, is currently 
in a higher orbit around 

comet 67P/Churyumov- 
Gerasimenko than it was in 
December, meaning that its 
photographs of the comet's 


anthropologists estimate that two-thirds 

of the species that existed only a thousand 

year's ago are now extinct. The cave, which 

can only be reached by skilled divers, also 
contains remains of other large extinct animals, 
including elephant birds (Mullerornis), giant 
fossa (Cryptoprocta spelea), horned crocodiles 
(Voay robustus, pictured) and pygmy hippos 
(Hippopotamus lemerlei). The team hopes to 
return to retrieve more specimens this summer. 


surface are fuzzier. Mission 
scientists hope to perform a 
fly-by, which would afford 
Rosetta a better view. 


SpaceX rocket crash 


The first attempt to landa 
reusable rocket on a boat 

has ended ina crash. The 
private company SpaceX, 

of Hawthorne, California, 
successfully launched an 
unmanned resupply ship to 
the International Space Station 
on 10 January — but failed 

to land the first stage of its 
Falcon 9 rocket on a floating 
platform in the Atlantic Ocean 
as planned because the fins 
that helped to stabilize its 
descent ran out of hydraulic 
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fluid. SpaceX is one of two 
companies tasked with flying 
cargo to the space station for 
NASA. Among other supplies, 
the ship carried an instrument 
to measure particulates in the 
atmosphere (see Nature http:// 
doi.org/x83; 2014). 


Parasite in decline 


A global campaign to eradicate 
guinea-worm disease is getting 
closer to its goal. The Carter 
Center, a medical charity in 
Atlanta, Georgia, announced 
on 12 January that only 

126 cases of the disease were 
reported in 2014, a 15% drop 
from the previous year. All of 
the 2014 cases occurred in one 
of four African countries. The 
majority — 70 cases — were 

in South Sudan. The Carter 
Center said that when it first 
launched its campaign in 

1986, 3.5 million people were 
afflicted with the tropical 
disease every year in Africa 
and Asia. 


Policy leader dies 
Hubert Markl (pictured), one 
of the most influential figures 
in German research policy, died 
on 8 January, aged 76. Markl 
was president of the DFG, West 
Germany’s major funding 
agency for basic research, 
during the reunification with 
East Germany in 1990. From 
1996 to 2002, he led the Max 
Planck Society, a leading 


TREND WATCH 


Global investment in clean-energy 


projects and research surged in 


2014, according to figures released 


on 9 January by Bloomberg New 
Energy Finance. Almost half of 
spending was on solar power, 
which rose by 25% from 2013 to 
US$149.6 billion. Clean-energy 
investment in China, the sector's 
leading spender, grew by 32% to 
arecord $89.5 billion, whereas 
spending in Europe rose by only 
1% to $66 billion. The recent 
slump in the prices of oil and 
natural gas is expected to slow 
global investment in renewables. 


research organization, through 
a modernization effort. He 
closed poorly performing 
institutes in the country’s west 
and opened new ones in the 
east, often with prominent 
foreign scientists as directors. 
Markl also launched an 
independent study of the 
society's activities during the 
Third Reich and formally 
apologized for the society’s 
Nazi-era crimes. 


Ebola vaccine trials 
Clinical trials of Ebola 
vaccines could begin in West 
Africa within a few weeks, 
officials of the World Health 
Organization in Geneva said 
after a high-level meeting 

on 8 January. The trials will 
test the efficacy of up to 

three different vaccines at 
preventing infection in people 
in Guinea, Liberia and Sierra 
Leone. On 12 January, the 


Wellcome Trust, a London- 
based biomedical-research 
charity that is involved in the 
trials, released a ‘road map on 
Ebola vaccines, emphasizing 
the need to involve local 
communities in organizing the 
trials. 


Stimulating Japan 


The Japanese government 
approved a ¥3.1-trillion 
(US$26-billion) 
supplementary budget 

on 9 January, to jolt the 
economy. The government 
will spend ¥351.5 billion 

to boost energy efficiency 

in the industrial sector and 
¥5.2 billion on research related 
to developing energy sources. 
The package reportedly 
included ¥1.2 billion for efforts 
to develop next-generation 
power semiconductors at 
Nagoya University. Another 
¥3.7 billion will support 
increased monitoring of 
Japan’s volcanoes. 


The personal-genetics start-up 
23andMe of Mountain View, 
California, has entered the 
first two of what it promises 
will bea series of agreements 
with major pharmaceutical 
companies. On 6 January, 
Genentech of South San 
Francisco, California, 
disclosed that it will pay up to 
US$60 million to access part of 
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Global spending on renewables grew by 16% in 2014, boosted 


by strong growth in Asia. 
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23andMe’s database containing 
the genome sequences of 
3,000 people with Parkinson's 
disease. And on 12 January, 
New York City-based Pfizer 
announced a broad agreement 
to access the collected genetic 
information of 800,000 
23andMe customers. Both 
deals were seen as motivated 
by the companies’ interest 

not only in 23andMe’ genetic 
data, but also in the personal 
data that its customers share 
with 23andMe, which could 
accelerate and improve clinical 
trials. 


Pharma deal 


Swiss pharmaceutical 

giant Roche announced on 
12 January that it will spend 
more than US$1 billion to 
purchase more than half 
the shares of biotechnology 
firm Foundation Medicine. 
The firm, in Cambridge, 
Massachusetts, generates 
genetic profiles of cancer 
tumours with the aim of 
finding treatments that can 
be personalized to patients. 
Roche, headquartered in 
Basel, says that it wants to 
sell Foundation’s DNA tests 
outside the United States, 
and will work with the firm 
to create tests that use blood 
samples, rather than a tissue 
biopsy. 


Diversity push 


Computer-chip giant Intel 

on 6 January pledged to 
spend US$300 million by 
2020 to boost the diversity 

of the technology industry's 
workforce. The company 
plans to invest in recruiting 
and retaining engineers 

and computer scientists 

who are women or from 
under-represented ethnic 
groups and to expand its 
educational programmes at 
schools and universities. Chief 
executive Brian Krzanich said 
that Intel aims to raise the 
representation of women and 
minorities at the company, in 
part to better cater to Intel’s 
diverse customer base. 


> NATURE.COM 
For daily news updates see: 
WWW.nature.com/news 


15 JANUARY 2015 | VOL 517 | NATURE | 249 
© 2015 Macmillan Publishers Limited. All rights reserved 


UESLE!| MARCELINO/REUTERS/CORBIS 


NEWSIN FQCUS 


Cats catch up Digital ‘GoreSat’ The bacteria 
to dogs in the genome- island built for eco- resurrected after 14 years that serve as live-in 
sequencing race p.252 experimentation p.254 in limbo p.256 light bulbs p.262 


7 


| ENVIRONMENT | 


Brazilian president Dilma Rousseff (left) has chosen climate-change sceptic Aldo Rebelo (right) to lead her science ministry. 


Political appointments spur 
concerns for Amazon 


Environmentalists worried after Brazilian president picks ministers with ties to agriculture lobby. 


BY JEFF TOLLEFSON 


razilian president Dilma Rousseff sur- 
B vived an unexpectedly strong challenge 
from a prominent environmentalist to 
stay in office. Now, less than three months into 
her second term, Rousseff has sparked contro- 
versy by appointing an avowed climate-change 
sceptic as science minister. 
The choice of Aldo Rebelo, a Communist 
law-maker who last July called the idea of 
global warming “incompatible with current 


knowledge’, follows a decade in which Brazil 
has slowed Amazon deforestation and claimed 
a leadership role in international climate pol- 
icy. Rebelo promised to uphold the govern- 
ment’s climate stance when he assumed his 
post at the Ministry of Science, Technology 
and Innovation on 2 January, but his appoint- 
ment has stirred fears that Brazil may backslide 
on environmental protection. 

“In general, scientists are sceptical and 
disappointed,’ says Jean Ometto, coordina- 
tor of the Earth System Science Centre at the 


government’s National Institute for Space 
Research in Sao Jose dos Campos. “But the sci- 
entific centre is strong, and there’s going to be 
pressure from the scientific community not to 
change the way that climate change is governed” 
Rebelo earned his reputation as a skilful pol- 
itician in the lower house of Brazil’s National 
Congress, and he played a leading part when 
the ‘ruralistas — a voting bloc representing 
rural agriculture — weakened protections 
for Brazil's forests in 2012. Most scientists say 
that Rebelo largely ignored the scientific 
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> community during that debate, although 
some think that he may move to the politi- 
cal centre as minister of science. 

Ina written statement provided to Nature, 
Rebelo dismissed concerns about his posi- 
tion on global warming. “The debate about 
climate change exists independent of my 
opinion,’ he said. More broadly, Rebelo 
said that his duty as minister of science is to 
listen to — and fight for — the academic and 
scientific community in Brazil. 

He has limited power to influence cli- 
mate policy, although the ministry does 
oversee some national assessments and 
some climate reports to the United Nations. 
One area that will receive considerable 
attention is the ministry’s funding for sci- 
ence, which has been squeezed by compet- 
ing demands in recent years. 

But Rebelo is not the only appointment 
to raise eyebrows. Rousseff named Katia 
Abreu, who fought environmentalists as a 
senator and agricultural lobbyist, to head 
Brazil’s agriculture ministry. And both 
appointments come at a sensitive time: 
although the rate of Amazon deforestation 
has dropped by 82% since 2004, the pace of 
land clearing spiked sharply in 2013 — and 
it seems to be on the rise once again. 

Rousseff has never had a good relation- 
ship with environmentalists, who say that 
she is hiding behind Brazil's relative suc- 
cess in reducing deforestation while push- 
ing for major dams and ports that will only 
increase pressure on forests. Now, after a 
bruising election, she is looking to rebuild 
support in Congress in the middle of cor- 
ruption scandals that have tainted her 
party. 

The Communist party, of which Rebelo 
is a member, has been a “small but loyal” 
member of the governing coalition led by 
Rousseff’s Workers’ Party, and it is also 
friendly with the agricultural caucus, says 
Steve Schwartzman, an anthropologist 
with the Environmental Defense Fund in 
Washington DC, who has worked in Brazil 
for decades. 

One key to Brazil’s success in curbing 
deforestation and carbon emissions over 
the past decade was the involvement of 
multiple agencies, but that kind of coor- 
dinated approach could be more difficult 
today, Schwartzman says. “These minis- 
terial appointments cast a lot of doubt on 
what we can expect from this government.” 

Helena Nader, president of the Brazilian 
Society for the Advancement of Science, 
takes a more conciliatory approach: she says 
that scientists should give Rebelo some time 
at the ministry before judging him. More 
broadly, Nader says that she isn’t expecting 
a major backslide on environmental issues 
by the new Rousseff government. 

“Nevertheless,” she says, “only time 
will show. 
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The cat genome is out of the bag, and has already helped to pinpoint a gene involved in kidney disease. 


‘I can haz 


genomes, 


too’ 


Cats sparked an Internet meme complete with its own 
grammar, but dogs have dominated genetics labs — untilnow. 


BY EWEN CALLAWAY 


( } ats may have beaten dogs on the Internet 
but felines have been a rare breed in 
genetics labs compared with their 

canine counterparts. Now, at last, cats are 

clawing their way into genomics. 

Ata meeting this week in San Diego, Califor- 
nia, a close-knit group of geneticists unveiled 
the first results from an effort to sequence the 
genomes of 99 domestic cats. The work will 
benefit both humans and felines, the research- 
ers say, by mapping the mutations underlying 
conditions that afflict the two species, such as 
kidney disease. 

“It’s a great time to be in cat genomics,” 
says William Murphy, a geneticist at Texas 
A&M University in College Station who is 
involved in the effort. Plummeting costs for 
DNA sequencing now make it possible to do 
genomics cheaply — and cat genomics, long 
under-funded compared with similar efforts 
in dogs, is benefiting, he says. “We're finally at 
the point where we can do all sorts of things we 
wanted to do 5 or 10 years ago.” 

The first cat genome sequence’ — from an 


2015 


© 2015 Macmillan Publishers Limited. All rights reserved 


Abyssinian named Cinnamon — was reported 
in 2007. But the sequence contained significant 
gaps and errors, which slowed efforts to map 
genes. A high-quality version of Cinnamon’s 
genome was not published? until late 2014. 
Domestic dogs, meanwhile, have become a 
darling of geneticists: their full genome’ was 
reported in 2005, and the sequence has been 
continually improved. Hundreds of genes 
underlying canine diseases and traits are esti- 
mated to have been discovered, compared with 
as few as a dozen for cats. 

The discrepancy can be traced back to 
the early 2000s. After the completion of the 
human, mouse and rat genomes, the US 
National Institutes of Health organized a com- 
mission to decide on their next target; the dog 
genome was selected for high-quality sequenc- 
ing, whereas cats were put on hold. 

That got some cat geneticists’ backs up. “The 
truth is there were more powerful people inter- 
ested in dogs,’ says Stephen O’Brien, director 
of the Theodosius Dobzhansky Center for 
Genome Bioinformatics in St Petersburg, Rus- 
sia, who led the initial cat-sequencing efforts. 

But canine researchers were able to make a 


WES FERNANDES 


compelling case. Pet dogs suffer from many 
of the same conditions as humans, from 
narcolepsy to arthritis. And the intensely 
inbred nature of dog breeds made it relatively 
easy to identify disease-causing genes: because 
there is little genetic variation within any par- 
ticular breed, the genes that cause disease in 
affected individuals stand out. 

Dogs had other advantages, too. The exist- 
ence of kennel clubs, which maintain “breed 
standards and are full of enthusiastic pet 
owners and veterinary surgeons, helped dog 
geneticists to recruit subjects for study. “Given 
the resources they had, they were discovering 
new genetic diseases in breeds almost daily,” 
says Niels Pedersen, a veterinary scientist at the 
University of California, Davis. 

In fact, both cats and dogs offer insights into 
human disease, including those associated 
with old age. In 2004, a team led by geneti- 
cist Leslie Lyons of the University of Missouri 
in Columbia (and owner of two female cats, 
Withers and Figaro) discovered that mutations 
that cause polycystic kidney disease — a major 
cause of renal failure in older individuals — 
occur in the same gene in humans and cats’. 
Cat versions of type 2 diabetes, asthma, retinal 
atrophy and numerous other conditions have 
close similarities to human disease. Cats can 
also become infected with a virus that is closely 
related to HIV and experience symptoms simi- 
lar to those of people with AIDS. 

In the hopes of speeding up the discovery 


of genes related to these conditions last year, 
Lyons launched the 99 Lives cat genome 
sequencing initiative, playfully hosted on a 
site called Lyons’ Den. She discussed the effort 
on 11 January at the Plant & Animal Genome 
conference in San Diego. 

Lyons’ team is cobbling together funding 
from anywhere it can find it. The research- 
ers are asking private owners, breeders and 

even pet-food com- 


“Twould love panies to donate the 
toeradicateall —_US$7,500 needed to 
genetic disease sequence the genome 
in cat breeds ofa single cat, which 
before we’re could be one of a 
done.” donor's choice. “Any 


cat can participate. 
Any owner can participate,’ she says. All the 
data will be made public after the results are 
published. 

With the money raised so far, the team has 
sequenced the genomes of 56 cats, including 
fancy breeds such as Burmese; cats with spe- 
cific diseases; and a kitten named Dragon and 
his parents Ares and Marcus — the hope is to 
use the feline trio to narrow down the genetic 
basis for traits they share, such as their silver, 
curly coats. 

Even Robert Wayne, a canine geneticist 
at the University of California, Los Angeles, 
agrees that Lyons effort is important. “I hope 
she raises money for it,” he says. 

Insights from cat genomics extend beyond 
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disease. Razib Khan, an evolutionary geneticist 
at the University of California, Davis, wants to 
use genome sequences to chart the domestica- 
tion and spread of cats throughout the world, 
and to determine how different domestic and 
wild cats are genetically. “There’s always the 
question — are they domesticated at all?” he 
says. The 2014 publication that included Cin- 
namon’s genome already identified differences 
between domestic and wild cats, including 
genes expressed in the brain that are possibly 
linked to the docility of (some) house cats. 
“Wild cats will hand you your behind if you 
get next to them and domestic cats will sit on 
your lap,” O’Brien notes. 

Lyons is also keen to see genomics help 
felines. “I would love to eradicate all genetic 
disease in cat breeds before were done,’ she 
says. Her team’s discovery of the cause of poly- 
cystic kidney disease has reduced its preva- 
lence among Persians, by removing cats with 
the mutation from the breeding pool. Her lab 
is now developing drugs that could treat the 
terminal condition in cats — and perhaps in 
humans. But human health, Pedersen says, is 
not the only goal. “I’m in it, and Leslie’s in it, 
for the good of cats.” = 
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First biosimilar drug 
set to enter US market 


But such cheaper, generic versions of biological drugs face 
scientific, regulatory and patent hurdles. 


BY HEIDI LEDFORD 


fter years of debate, the US Food and 
A™ Administration (FDA) is poised 
to allow the sale of biosimilars, cheaper 
versions of complex and expensive biological 
drugs used to treat conditions such as cancer 
and autoimmune diseases. On 7 January, an 
FDA advisory panel decided unanimously 
that a drug made by Sandoz, the generics arm 
of Swiss pharmaceutical giant Novartis, should 
be accepted as a replacement for filgrastim 
(Neupogen), an immune-boosting drug for 
people undergoing cancer treatment made by 
Amgen of Thousand Oaks, California. 
Such knock-offs are called biosimilars 


because the drugs that they mimic, dubbed 
biologics, consist of complicated molecules 
that are made in living cells and are impos- 
sible to copy exactly. Even copying them inex- 
actly is immensely challenging — despite the 
expected approval of the Sandoz drug, the 
difficulties involved in creating and evaluat- 
ing biosimilars may limit their infiltration 
of the marketplace. The field is also littered 
with patent issues, especially with regard to 
how the drug is manufactured. In the case of 
filgrastim, Sandoz is challenging some of the 
legal requirements for approval. 

“We're starting from scratch,’ says Jordan 
Paradise, a specialist in technology law at Seton 
Hall University in Newark, New Jersey. “A lot of 


the scientific uncertainty is still there” 

Unlike typical drugs, which are relatively 
small molecules made through biochemical 
processes, biologics are large protein molecules 
produced by genetically engineered organ- 
isms. Living cells may chemically modify the 
proteins they make by adding complex sugars 
and other compounds at certain positions. The 
exact conditions under which cells are grown 
can alter the pattern of these modifications, and 
thus the molecule’s structure and behaviour. 
The result is a drug so complex that it is diffi- 
cult — if not impossible — to fully characterize. 

Because biosimilars are inexact copies, they 
are required to undergo more testing than an 
ordinary generic drug. The European Union 
has been evaluating and approving biosimi- 
lars for the past decade, but the United States 
did not have a way to do so until regulatory 
legislation was passed in 2010. Biotechnology 
companies have been waiting to find out how 
the FDA would evaluate the drugs. 

Patient advocates hope that biosimilars 
can reduce drug costs by increasing compe- 
tition. Biologics are expensive: researchers 
have calculated that treatment of metastatic 
colorectal cancer with bevacizumab (Avastin) 
costs about US$75,000 per year of life gained 
(V. Shankaran et al. Oncologist 19, 892-899; 
2014). A report last year by the RAND P 
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> Corporation in Santa Monica, California, 
estimated that biosimilars could save $44.2 bil- 
lion by 2024. 

Filgrastim is relatively simple: it is a small pro- 
tein with no attached sugars. Even so, Sandoz 
presented the FDA with clinical-trial data from 
388 people with breast cancer and 174 healthy 
participants to show that its biosimilar breaks 
down in the body similarly to the original, and 
does not provoke an immune response. 

The FDA is expected to make a final decision 
by May. But even as Sandoz prepares to sell its 
drug in the United States, it is embroiled in a 
patent fight with Amgen. By US law, Sandoz 
had to reveal the details of its manufacturing 
method to Amgen — a provision not present 
in Europe — so that Amgen could determine 
whether any of its patents had been violated. 
Sandoz refused. That is a disheartening prec- 
edent, says Paradise. “Here we've got the first 
biosimilar application and we've already got the 


manufacturers not working together.” 

Such concerns loom large among manu- 
facturers, says Nicholson Price, a patent-law 
specialist at the University of New Hamp- 

shire School of Law 


“We’ve got the in Concord. Drug 
first biosimilar —_fitms often keep 
application their manufacturing 
and we’ve methods confiden- 
already got the tial, and production 
manufacturers of complex drugs 
not working gives them ample 


opportunity to file 
patents on manufac- 
turing techniques or 
ways of characterizing molecules. “The sec- 
ond company is attempting to feel its way in 
the dark to what the first company has done,” 
says Price. “I suspect there are other biosimi- 
lars that are being deterred either by specific 
patents or just the worry that there may be 


together.” 


patents lurking out there that they don’t know 
about.” 

Even when a biosimilar makes it over these 
hurdles, it is unclear how consumers will react to 
a drug that is almost, but not quite, a copy of the 
original. At the advisory-committee meeting, a 
number of patient groups expressed support for 
biosimilars and the promise of relief from high 
drug prices. But they voiced concerns that bio- 
similars would be given the same generic names 
as the drugs they were meant to replace, creating 
confusion as to whether recipients were getting 
the original or the copy. Many will be watching 
the Sandoz drug’s approval to see whether the 
FDA will let it be called filgrastim. 

Committee member James Liebmann, an 
oncologist at the University of Massachusetts 
in Worcester, reacted to that concern with sur- 
prise. “This has been pretty clearly shown to be 
filgrastim,” he said. “To name it anything else 
would be misleading” = 


MAXUSER/SHUTTERSTOCK 


MICROSCOPY 


Inflated brains show nano detail 


Diaper material expands tissue, enabling ordinary microscopes to reveal nanoscale features. 


BY EWEN CALLAWAY 


n innovative method could enable 
Aes to image an entire brain in 

exquisite molecular detail using an 
ordinary microscope. 

The technique, called expansion micros- 
copy, involves physically inflating biological 
tissues using a material more commonly found 
in baby nappies, or diapers. Edward Boyden, a 
neuroengineer at the Massachusetts Institute 
of Technology (MIT) in Cambridge, discussed 
the technique, which he developed with his 
MIT colleagues Fei Chen and Paul Tillberg, at 
a conference last month. 

Conventional optical microscopes cannot 
distinguish objects that are closer together 
than about 200 nanometres. Although super- 
resolution microscopy can discern objects as 
close together as about 20 nm, they require 
expensive, specialized equipment, and struggle 
with thick structures such as sections of brains. 


> 


MORE 
ONLINE 


“What we've been trying to do is figure out if 
we can make everything bigger,’ Boyden told 
the meeting at the US National Institutes of 
Health (NIH) in Bethesda, Maryland. To do 
this, his team used a chemical called sodium 
acrylate, which has two useful properties: it 
can form a dense mesh that holds proteins in 
place, and it swells in the presence of water. 
The acrylate is the same substance that gives 
nappies their sponginess. When inflated, 
Boyden’s tissues grow by a factor of about 4.5 
in each dimension. 

Before swelling, the tissue is treated with a 
chemical cocktail that makes it transparent, 
and then with fluorescent molecules that anchor 
specific proteins to the acrylate, which is then 
infused into the tissue. Just as with nappies, 
adding water causes the acrylate to swell. After 
stretching, the fluorescent-tagged molecules 
move farther away from each other; proteins 
that were previously too close to distinguish 
with a visible-light microscope come into crisp 
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focus. In his presentation, Boyden suggested 
that the technique can resolve molecules that 
are as close as 60 nm before expansion. 

Crucially, the process generally maintains 
the relative orientation and interconnection 
of proteins and keeps other cellular structures 
intact: it distorts the relative position of pro- 
teins by just 1-4%, Boyden’s team calculated. 

Viviana Gradinaru, a neuroscientist at the 
California Institute of Technology in Pasadena, 
says that Boyden’s technique is another exam- 
ple of how scientists are bypassing hardware 
limitations by modifying biological tissue. 

“This is certainly highly ingenious, but how 
much practical use it will be is less clear,” notes 
Guy Cox, a microscopy specialist at the Uni- 
versity of Sydney, Australia. “If this is to be of 
any serious use, I suspect it will be in collabora- 
tion with existing super-resolution techniques 
on small macromolecular complexes, to push 
the boundaries a bit further, rather than look- 
ing at whole cells.” = 
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Moorea is one of the most studied ecosystems in the world. Myriad data collected over four decades will be used to 
build a digital replica of the island that includes its varied geography, its climate and all of its plant and animal life. 


Social science 
Government census data will be 
combined with tourist numbers, 
employment status and 

economic revenue. 


Watery secrets 

Continual sampling of the 
water's temperature, salinity, pH 
and microbial diversity at sites 
around the island, including the 
Tiahura Marine Protected Area, 
shown here. 


ENVIRONMENT 


Catalogue of life 


populations, including numbers 
and species composition. 


DNA barcodes for every species 
more than 1 millimetre in length. | 


, Ft - = 


Peaks and valleys 
Landscape model to 70- 
centimetre resolution created 
from satellite images. 


“ : 
©) Ocean circulation i 
$ Time-series measurements of 
= currents, waves and water 
>) properties from an underwater 

~~ sensor array around the island. 


=U 
* . 
_ Underwater terrain 


Sea floor mapped using satellite 
imagery and sonar data from 
ships. Will have a resolution 
ranging from 0.5 metres in the 
shallows to 20 metres in the 
deep ocean. 


Tropical paradise inspires 
virtual ecology lab 


Digital version of Moorea will provide a way to experiment with an entire ecosystem. 


BY DANIEL CRESSEY 


paradise on Earth could soon become 
the first ecosystem in the world to 
be replicated in digital form in pain- 
staking detail, from the genes of its plants and 
animals to the geography of its landscape. 
An international team is preparing to create 
a digital avatar of the Pacific island of Moorea, 
which lies off the coast of Tahiti and is part of 
French Polynesia. Moorea is already one of the 
most studied islands in the world; the team plans 
to turn those data into a virtual lab that would 
allow scientists to test and generate hypotheses 
about the impact of human activities. 
Ecologists have used models for years to 
tease out the relationships between different 
facets of nature, such as temperature and pop- 
ulation or predators and prey. But much of that 
modelling is relevant only to specific species or 


research questions, and some scientists want a 
holistic view. As human activity and natural 
variations combine to alter the environment, 
researchers need to know how mitigating 
steps — such as setting up protected areas, or 
attempts to curb fossil-fuel use — might affect 
an entire ecosystem. 

“We know the world’s changing. Yet the 
decisions we're making, we're making them in 
the dark,” says Neil Davies, one of the people 
behind the Moorea IDEA (Island Digital Eco- 
system Avatars) project and director of Gump 
Station, the University of California, Berkeley’s 
marine-science base on the island. “We're not 
going to have precise predictions ever, but 
we need to have a way of modelling different 
scenarios.” For example, if a hotel is built at 
a certain location, how does that change the 
ecosystem? Ifa species disappears from a river, 
what happens downstream? 


Mooreais an ideal place to start, says Davies, 
because the island is about 16 kilometres 
across and has just 17,000 people living on it, 
making it easier to model than larger ecosys- 
tems and those that are more connected to the 
rest of the world. In addition, French research- 
ers have been there since the 1970s, and Gump 
Station has been operating since the 1980s. 
Both efforts have collected myriad data on 
the island’s waters, with decades-long studies 
of coral and fish numbers (see “Data heaver). 

These traditional surveys of marine life are 
now being linked up with the Moorea Biocode 
Project, which aims to characterize every spe- 
cies larger than a millimetre in length on the 
island and allocate them a ‘DNA barcode’ — 
snippets of DNA that can be used as a unique 
identifier. Species can thus be identified 
quickly and easily even when they are in places 
or states that would otherwise be difficult > 
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> to recognize, such as in the contents of 
another organism's stomach, or in seed or 
larval form. 

The avatar would combine insights 
gleaned from the Biocode project — such as 
which species are present at certain ocean 
spots, or which species are eaten by another 
— with data on weather, ocean currents 
and society such as population density and 
real-estate prices. It would provide a three- 
dimensional visualization of the island 
and its surrounding waters that might look 
something like those on Google Earth, but 
would enable researchers to zoom into a 
location, access data and run simulations. 

“The first stage will be a framework to 
integrate the data we have. To collate them, 
combine them, and to make the data acces- 
sible to scientists,” says project member 
Matthias Troyer, a computer scientist at 
the Swiss Federal Institute of Technology 
in Zurich. “Then, based on that, one can 
start on modelling.” 


EXPANDING PROJECT 
The IDEA project was born in 2013, the 
brainchild of Davies, Troyer and three 
other marine scientists: Dawn Field at the 
University of Oxford, UK; Sally Holbrook at 
the University of California, Santa Barbara; 
and Serge Planes from the French research 
base on Moorea. The consortium now has 
more than 80 participants. 

At meetings late last year, the IDEA team 
discussed how to combine existing data 
with those coming from the latest tech- 
nologies. Some of the framework for the 
avatar is already under construction, and 
Davies says that the team is seeking funding 
of around US$5 million over three years to 
pursue a pilot project. 

The project is “really novel in the mod- 
elling community’; says Mike Harfoot, an 
ecosystem modeller at the United Nations 
Environment Programme's World Con- 
servation Moni- 
toring Centre in 


Cpt. a 
Cambridge, UK, It’s impressive 
Pee the amount of 
because it will inte- 
data going 


grate societal data 
with physical and 
biological compo- 
nents. And, he adds, the computational 
power required to take a holistic approach 
to modelling ecosystems has only recently 
become available. 

“It’s impressive the amount of data that’s 
going in it,” says Rick Stafford, a computa- 
tional ecologist at Bournemouth University, 
UK. Getting the different data sets to talk to 
each other will be a challenge, but the time 
is ripe for such an ambitious undertaking, 
says Davies. And it if works on Moorea, the 
approach could be rolled out to other parts 
of the world. Although ambitious, says 
Davies, “it’s not a pipe dream’. m 


into it.” 
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The DSCOVR craft is removed from a container in Cape Canaveral, Florida. 


Mothballed NASA 
craft to launch 


Proposed by former US vice-president Al Gore in 1998 to 
image Earth, DSCOVR probe will monitor space weather. 


BY MARK ZASTROW 


fter nearly 14 years in limbo, an Earth- 
Aussie spacecraft built by NASA 
is finally set to fly. 

The Deep Space Climate Observatory 
(DSCOVR), scheduled to launch as soon as 
29 January, will constantly observe Earth's 
sunlit side from a distance of 1.5 million kilo- 
metres. It will track daily weather patterns and 
seasonal vegetation changes, monitor atmos- 
pheric pollution and make the most precise 
measurements yet of how much energy Earth 
throws out into space — crucial data for the 
improvement of global climate models. 

DSCOVRs resurrection is thanks to renewed 
interest in what was originally its secondary 
mission: monitoring space weather. From a 
point between the Sun and Earth at which the 
bodies’ gravitational pulls cancel out, the probe 
will be able to detect approaching solar storms 
— bursts of charged particles and powerful 
radiation that pose a threat to astronauts, orbit- 
ing satellites and power grids on the ground. 
Such storms are of interest to the US Air Force, 
which is funding the satellite’s launch, and the 
National Oceanic and Atmospheric Adminis- 
tration (NOAA), which will operate it. 

DSCOVR was the brainchild of former 
US vice-president Al Gore. He imagined a probe 
that would beam down a live image of Earth's 
illuminated side that could be available online. 
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Just as the famous “blue marble’ image of Earth 
taken by the Apollo 17 crew had inspired people, 
Gore said in 1998 that DS>COVR would “awaken 
a new generation to the environment and edu- 
cate millions of children around the globe”. 

To Gore’s critics in Congress — particularly 
Republicans sceptical of his environmental 
advocacy — those were fighting words. The 
mission was nothing more than “a multi- 
million-dollar screen saver’, said representa- 
tive Dave Weldon (Republican, Florida). It 
acquired the unflattering nickname ‘GoreSat. 

“The worst thing that can happen to science 
is to get mixed up in politics,’ says Francisco 
Valero, a retired climate scientist who was at the 
Scripps Institution of Oceanography in La Jolla, 
California, and led the satellite's original Earth- 
science team. “That is what happened to us.” 

Although Gore intended the project to 
be mostly educational, NASA formulated a 
complementary science mission by soliciting 
proposals from the community. Valero’s win- 
ning pitch was a probe to measure how much 
radiation Earth reflects back into space, a cru- 
cial variable for untangling the web of processes 
that influence the planet's climate. Clouds, for 
example, are a perennial conundrum for climate 
models because they both reflect incoming sun- 
light and trap outgoing heat. Valero proposed 
two instruments: a camera called EPIC to image 
clouds and other climate-influencing factors 
such as pollution, volcanic ash and seasonal 
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vegetation; and a radiometer named NISTAR 
to measure the energy coming from Earth. 
Current estimates of the planet’s energy 
balance rely on stitching together strips of 
data from orbiting satellites, but DSCOVR will 
observe the entire sunlit side of Earth. It should 
reduce errors in estimates of Earth’s radia- 
tion budget to 1.5% — a more-than-twofold 
improvement on present measures, says climate 
scientist Patrick Minnis of NASA’s Langley 
Research Center in Hampton, Virginia, who 
now leads this aspect of the mission. It will not 
be the final word, says Adam Szabo, the mission's 
project scientist at NASA's Goddard Space Flight 
Center in Greenbelt, Maryland, but DS>COVR 
will bea big help to climate simulations. And it 
will partly fulfil Gore's original vision by posting 
snapshots of Earth online every few hours. 


POLITICAL SCIENCE 

Although it blew past its original US$50-million 
budget to roughly $100 million, DSCOVR was 
still built relatively quickly and cheaply. NASA 
completed construction in 2000, intending to 
launch the craft on the space shuttle. 

In January 2001, George W. Bush became 
president after defeating Gore in a controver- 
sial election. Soon after, the mission was taken 
off NASAs shuttle flight manifest. The official 
reason was that construction of the chronically 
delayed and over-budget International Space 


Station required a higher priority. 

But that did not stop speculation about 
political motives. ‘Who killed DSCOVR?’ 
became something ofa parlour game in space 


circles. Mitchell 
“The worst thing Anderson, a reporter 
that canhappen _ forthe climate website 
to science is to DeSmogBlog in Van- 
getmixedupin _couver, Canada, cited 
politics.” an unnamed NASA 


source who said that 
Bush's vice-president Dick Cheney had given 
the order; others suggest that it was the presi- 
dent himself. 

In reality, the space shuttle’s crowded 
launch schedule was the biggest obstacle, says 
Ghassem Asrar, who was the head of NASA's 
Earth-science division when the decision was 
made. But the project had become “tainted”, 
he adds, preventing public support from pri- 
vately sympathetic politicians and from NASA 
itself. “It would be dishonest to say the politics 
of climate science wasn't a factor. It was.” 

In November 2001, with no launch slot 
in sight, Congress approved $1 million for 
DSCOVR to be put into storage at Goddard. 
And there it might have remained were it not 
for interest from space-weather forecasters 
at NOAA and in the Air Force. In 2008, they 
were looking to cheaply replace NASAs ageing 
Advanced Composition Explorer, which had 
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been informing forecasts from the same spot in 
space that DSCOVR was supposed to occupy. In 
October that year, Congress ordered NASA to 
come up witha plan for DSCOVR’ revival, and 
after a series of tests, it began funding NOAA to 
refurbish and operate the craft with $105 mil- 
lion over five years. 

For Jay Herman, an atmospheric scientist at 
Goddard and EPIC instrument scientist, the 
delay has a silver lining: the refurbishment 
revealed a manufacturing defect in EPIC that 
would have let in stray light and potentially 
ruined its image of Earth. The delay allowed 
enough time to study the problem and correct 
for it. “So in some ways,” says Herman, “I’m 
very glad it did not fly 14 years ago. Because it 
might have been embarrassing.” m 


CORRECTION & CLARIFICATION 

In the story about Suzanne Topalian in 
‘Nature’s 10’ (Nature 516, 311-319; 2014), 
the text wrongly noted that the July approval 
for the drug she’d been involved with was in 
the United States — it was in Japan. 

The News Feature ‘Pollution patrol’ (Nature 
517, 136-138; 2015) quoted Joshua Apte 
as saying that air pollution is the largest 
global health risk. What he meant to say was 
that it is the largest environmental health risk. 
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Biochar —a soil additive made by heating biological material — is catching attention as a means to improve crop growth and clean up contaminated water. 


STATE-OF-THE-ART SOIL 


Acharcoal-rich product called biochar could boost agricultural yields and 
control pollution. Scientists are putting the trendy substance to the test. 


BY RACHEL CERNANSKY 


or more than 150 years, the Brooklyn Navy Yard 

constructed vessels that helped to stop the slave trade 

from Africa, lay the first undersea telegraph cable 

and end the Second World War. Now, this sprawling 
industrial facility in New York City is filled with artists, 
architects, producers of artisanal moonshine and people 
growing organic vegetables. On a drizzly day in autumn, 
Ben Flanner tends a sea of red and green lettuce on a 
6,000-square-metre rooftop farm. 

The soil beneath the plants looks ordinary, but Flanner 
grabs a handful and holds it up for inspection. Amid the 
brown clods of dirt are small black particles — remnants of 
charcoal fragments that were mixed into the soil two years 
ago. Flanner thinks that this carbon-rich material, known 
as biochar, has helped the crops to thrive, possibly even 
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increasing their yield, and he hopes for more impressive 
results over the next few years. 

Across the United States, sales of this long-lasting soil 
additive have surged over the past few years, tripling annually 
since 2008, according to some estimates. The Biochar Com- 
pany in Berwyn, Pennsylvania — which supplied Flanner’s 
Brooklyn farm — sells it both wholesale and direct to con- 
sumers, through outlets including Amazon and some Whole 
Foods stores. And countries ranging from China to Sweden 
are using biochar on agricultural fields and city lawns. 

Proponents see big potential for the soil enhancer, which 
is produced by heating biological material — such as husks 
and other agricultural waste — in a low-oxygen chamber. 
Biochar can be made as a by-product of biofuel generation, 
so some companies are hoping to cash in on both products 
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as demand grows for greener forms of energy. 

Interest in biochar is also growing among 
scientists, who are quickly ramping up stud- 
ies to test its potential. They are particularly 
interested in how the chemical and physical 
properties of biochar particles affect water 
moving through soil, remove pollutants, alter 
microbial communities and reduce emissions 
of greenhouse gases. The hope is that biochar 
can help farmers around the world, particu- 
larly those in Africa and other developing 
regions, who often struggle with poor soils. 

Johannes Lehmann, a crop and soil scientist 
at Cornell University in Ithaca, New York, says 
that different types of biochar “have unique 
potential to mitigate some of the greatest soil- 
health constraints to crop productivity — for 
example, in highly weathered and sandy soils”. 

But there are still many questions about 
biochar, particularly in terms of making sure 
that it is affordable and has positive effects. In 
some studies, the material has actually reduced 
yields. Part of the difficulty is that biochar can 
be produced from all kinds of biomass and at 
different temperatures and speeds, which leads 
to huge variation in the substance — and in 
results. “I always say we should not even use 
the singular for biochar,’ says Lehmann. 
“There are only biochars.” 


AMAZONIAN ROOTS 

Although it is just starting to catch on with 
farmers today, biochar has ancient roots. Hun- 
dreds to thousands of years ago, residents of 
the Amazon produced it by heating up organic 
matter to create rich, fertile soils called terra 
preta. But the practice was abandoned around 
the time that European nations invaded South 
America, and relatively few farmers elsewhere 
have routinely used biochar. Scientists first took 
a big interest in the material about a decade ago, 
when growing concerns over global warming 
led some to tout biochar as a way to store huge 
amounts of carbon underground. Hope for that 
application has faded somewhat, but soil scien- 
tists are now exploring its use in agriculture and 
remediating pollution. 

A particular focus has been explaining how 
biochar affects water movement through soils. 
Rebecca Barnes, a biogeochemist at Colorado 
College in Colorado Springs, and some of her 
colleagues tested that by adding biochar to dif- 
ferent materials’. In sand, through which water 
typically drains very quickly, biochar slowed the 
movement of moisture by an average of 92%. 
In clay-rich soil, which usually retains water, 
biochar sped up movement by more than 300%. 

The researchers suggest that the biochar 
alters how water moves through the interstitial 
space — the gaps between grains in the soil. 

“Clays tend to be flat grains and sand tends 
to be circular grains, but biochar is very amor- 
phous — and so it’s not only creating these crazy 
pathways through the biochar, but it’s also cre- 
ating crazy pathways in that interstitial space,” 
says Barnes. She and her colleagues suggest that 


these convoluted pathways help to slow down 
drainage in sand and speed it up in clays. 
Thatis significant, Barnes says, because even 
though clays can hold large amounts of water, 
that moisture has a hard time moving through 
the grains and reaching plant roots. Some stud- 
ies have shown that plants grow better in soil 


THE HOPE 18 THAT 
BIOCHAR COULD 
HELP FARMERS, 

PARTICULARLY IN 

AFRICA AND OTHER 

DEVELOPING REGIONS. 


with added biochar than in plain soils or those 
treated just with compost’. 

Researchers are also teasing apart how 
biochars influence microbial activity in soil. 
Microbes typically act as a community; for 
example, many pathogenic bacteria attack a 
plant’s roots only when they have sufficient 
numbers to overwhelm the host’s immune 
response. Caroline Masiello, a biogeochemist 
at Rice University in Houston, Texas, and her 
co-workers have found’ that biochar can inhibit 
this by binding to the signalling molecules that 
bacterial cells secrete to coordinate their activity. 

“They all think they’re alone, because the 
telephone wires have been cut,’ says Masiello. 
With further research, she says, it might be 
possible to fine-tune this function of biochar 
to reduce plant infections. 

Other researchers are exploring how biochars 
can cut emissions of nitrous oxide, a greenhouse 
gas, from agricultural fields. Last year, Xiaoyu 
Liu, a soil scientist at Nanjing Agricultural Uni- 
versity in China, and his colleagues reported* 
that after biochar had been applied to maize 
(corn) and wheat fields once, nitrous oxide 
emissions declined over the following five crop 
seasons, a period of three years. Other studies 
have shown reductions as well, but research- 
ers have not yet been able to determine what 
exactly causes this effect. Applying biochar 
“can also improve some soil properties, like it 
can increase the potassium availability, and the 
soil organic-matter content’, says Liu, who has 
obtained some funding from biochar producers. 

But not all studies show biochar to be a won- 
der material. In some cases it has reduced crop 
yields’, and one study* suggests that it lowers 
the activity of plant genes that help to defend 
against insect and pathogen attacks. 

Lehmann says that this may come down to 
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improper applications of biochar. In some of the 
studies that showed decreases in yields, he says, 
the soils were perfectly fine to start with. Other 
work suggests’ that using the wrong type of bio- 
char can negatively impact the soil’s microbiota 
or, potentially, its carbon-storage capacity. A 
biochar made from rice straw, for example, will 
function differently in a certain soil than will 
biochar made from wood or manure. 

Overall, however, the positive impacts of bio- 
char seem to outweigh the negative ones. A 2011 
meta-analysis® found an overall average yield 
increase of 10%, rising to 14% in acidic soils. 
Biochar’s greatest potential might be in places 
where soils are degraded and fertilizer scarce, 
in part because it helps the soil to better retain 
any nutrients that it does have. Andrew Crane- 
Droesch at the University of California, Berke- 
ley, has been studying the impacts of biochar in 
such degraded soils in western Kenya. His pre- 
liminary data suggest that farms using biochar 
averaged 32% higher yields than controls. 

In June, a World Bank report? said that 
biochar probably holds the most potential for 
small farmers in developing countries, not just 
because they are working with the soils most 
likely to benefit, but because biochar may be 
a key element of ‘climate-smart’ agriculture 
— practices that both help to mitigate climate 
change and reduce vulnerability to its effects. 


POLLUTION WRANGLER 

Biochar’s start may have been in agriculture, 
but researchers are now looking at other appli- 
cations. Biochar can bind to heavy metals in soil, 
which helps to keep them from reaching plants 
or entering water supplies. That has attracted 
the notice of the US Environmental Protection 
Agency, other agencies, and companies seek- 
ing to reclaim land formerly used in mining. At 
the Hope Mine near Aspen, Colorado, biochar 
added in 2010 helped to neutralize the impacts 
of decades-old mine refuse by immobilizing the 
metals and increasing the amount of water held 
on the slope — thereby reducing the opportu- 
nity for contaminated water to become run-off. 
It also helped to spur plant growth on the for- 
merly barren hillside, according to the Aspen 
Center for Environmental Studies. 

Biochar is also showing promise in cleaning 
up polluted water, perhaps as a much cheaper 
replacement for activated charcoal, which is 
used at sites ranging from treatment plants to 
areas that are heavily contaminated with toxic 
chemicals. Biochar particles have a relatively 
large surface area, which expands even further 
in water, providing a vast number of sites for 
contaminants to bind to, says Charles Pittman, 
a retired chemist at Mississippi State University 
in Starkville. He says that this type of pollution 
remediation may be particularly beneficial in 
countries that lack full water-treatment sys- 
tems. It could also help to remove antibiotics or 
chemical wastes, which are difficult to strip out 
with conventional water treatments. 

Scientists have even explored biochar’s 
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Workers at the Villa Carmen Biological Station in Peru turn soil containing black flecks of biochar, produced by burning bamboo in metal drums. 


potential for treating fluids used in oil and 
gas drilling, and as a component of print ton- 
ers and paint products. “There’s a lot of other 
markets that haven't fully been explored yet,’ 
says Kurt Spokas, a soil scientist with the US 
Department of Agriculture’s Agricultural 
Research Service in St Paul, Minnesota. 

Experts caution, however, that it is not clear 
when or whether remediation — or other appli- 
cations — will be economical, particularly in 
agriculture. Poor soils and poverty often go 
together. After demonstrating yield increases in 
Kenya, Crane-Droesch looked at the economic 
viability of biochar in the same communities. 
“What we found was almost nobody was will- 
ing to pay for biochar when offered at roughly 
the price it took to make it,” he says. 

Biochar prices vary widely, but in the United 
States some products cost US$3 per kilogram, 
comparable to certain fertilizers and more than 
many composts. On a large scale, biochar pro- 
duction may make economic sense only when 
biofuel production does — for example if it is 
subsidized or because policies to reduce carbon 
emissions drive fossil-fuel prices up. 

And if demand ever does surge, there will 
be questions about the environmental impact 
of producing biochar. One key concern is the 
choice of feedstock. China is eager to use agri- 
cultural waste, such as rice and wheat straw, and 
some researchers in the United States are even 


pushing animal manure, but neither may be the 
most efficient way to produce it on a massive 
scale. And using wood could spur deforestation 
or harmful land-use practices. 

“It’s an incredibly important question to ask: 
what is the sustainability of the feedstock?” 
says Alfred Gathorne- Hardy, research director 
of the India Centre for Sustainable Develop- 
ment at the University of Oxford, UK. “This 
is the kind of debate I don't think we're seeing 
enough about within the biochar world” 


GROWTH INDUSTRY 
That debate may grow as consumer interest 
does — something that is slowly happen- 
ing around the world. Bjérn Embrén, who 
is responsible for tree planning and protec- 
tion in Stockholm, says that the city has been 
using biochar to boost local vegetation since 
2009; he credits it with the city’s healthiest tree 
growth in recent years. In September, the New 
York-based charity Bloomberg Philanthropies 
awarded Stockholm €1 million (US$1.2 mil- 
lion) to launch a city-wide programme that 
will turn residential garden waste, and eventu- 
ally food waste and even sewage, into biochar. 
Back in Brooklyn, Flanner continues to 
monitor his crops. The lettuces and carrot 
tops glisten under the rain as he steps care- 
fully between rows in his bright yellow rain 
jacket. He thinks that the biochar will be good 
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for his soils over the long term because it helps 
them to retain nutrients and water. “Those are 
both very important, especially in such a well- 
drained soil as on a green roof,’ he says. “We 
tend to lose both of those quickly” 

But before he adds more of the black grains 
to other parts of his farm, he will wait to see 
how the crops respond over the next few years. 
Like the scientists studying biochar, he is eager 
to see whether it will live up to its bright prom- 
ise or fade like so many other would-be won- 
der materials. = 


Rachel Cernansky is a freelance writer in 
Denver, Colorado. 
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HERE S LOOKING AT YOU, SQUID 


Margaret McFall- Ngai has dissected the relationship between a beautiful squid 
and its live-in bacteria — and found lessons for microbiome research on the way. 


BY ED YONG 


he aquarium looks empty, but there is something in it. A pair 

of eyes stick out from the sandy floor, and their owner is easily 

scooped up into a glass bowl. At first, the creature looks like a 

hazelnut truffle — small, round and covered in tiny flecks. But 
with a gentle shake, the flecks of sand fall off to reveal a female Hawaiian 
bobtail squid (Euprymna scolopes), about the size of a thumb. As she jets 
furiously around the bowl, discs of pigment bloom and fade over her 
skin like a living pointillist painting. 

There are no other animals in the bowl, but the squid is not alone. Its 
undersides contain a two-chambered light organ that is full of glowing 
bacteria called Vibrio fischeri. In the wild, their luminescence is thought 
to match the moonlight welling down from above and cancel out the 
squid’s shadow, hiding the animal from predators. From below, the 
squid is invisible. From above, it is adorable. “They're just so beautiful? 
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says Margaret McFall-Ngai, a zoologist at the University of Wisconsin- 
Madison. “They’re phenomenal lab animals.” 

Few things excite McFall-Ngai more than the partnership between the 
bobtail squid and V. fischeri — and that is after studying it for more than 
26 years. Over that time, she has shown that this symbiotic relationship 
is more intimate than anyone had imagined. She has found that the bac- 
terium out-competes other microbes to establish an entirely faithful rela- 
tionship with one host. It interacts with the squid’s immune system, guides 
its body clock and shapes its early development by transforming its body. 

Some of these discoveries have helped to shape her field. When 
McFall-Ngai started her career in 1978, microbiologists were focused 
almost entirely on pathogens and disease. But in the past decade, 
advances in genetic sequencing have allowed scientists to identify the 
trillions of microbes in the bodies of humans and other animals, and 
to show how they support development, digestion and even behaviour. 
The study of these communities, collectively known as the microbiome, 
is now one of the hottest areas in biology, and some of the discover- 
ies made by McFall-Ngai have paved the way. “She pioneered work on 
animal-microbe interactions well before everyone caught up and the 
microbiome became such a sexy topic,’ says Dianne Newman, a geobiol- 
ogist at the California Institute of Technology in Pasadena. 

The microbiome boom is both a blessing and a curse. Attention and 
funding has focused heavily on projects to sequence microbes en masse, 
particularly in the human body, and on efforts to understand how they 
affect health. The squid and its luminous partner risk being eclipsed, at 
atime when funding is increasingly tight. But even the most prominent 
microbiome researchers say that they have time for McFall-Ngai and her 
squid—bacteria symbiosis, because understanding this simple relation- 
ship could help to make sense of more complex microbial communities, 
which are, by their nature, harder to study. “Td argue that it’s important 
to take advantage of the lessons emerging from such systems,’ says Jeff 
Gordon at Washington University in St. Louis, one of the leading figures 
in human microbiome research. “Their importance isn't diminished.” 

The squid may represent the road less travelled — but McFall-Ngai 
has always been drawn by such paths. “When I first met her, we were 
both in LA, driving a lot,’ recalls her partner, Ned Ruby. “If she was 
driving from A to B, even if there was one obvious way, shed try all these 
routes. Most would be longer. I'd say, “Why are we doing this?’ Shed say, 
“You never know when the freeway’s going to be blocked. I want to scout 
out the ways of going round’ That’s how she does science. She doesn't 
go down the main road and get blocked. She goes down the side roads.” 


LIGHT THE WAY 

McFall-Ngai started down her scientific road as a graduate student, 
when she became fascinated by bioluminescence and started study- 
ing ponyfish, which carry a glowing bacterium. She wanted to under- 
stand how these partnerships began, but was frustrated because the 
fish proved impossible to raise in a lab. Then, a colleague said to her, 
“Hey, have you heard about this squid?” A few embryologists had been 
studying the creature, which swims in shallow reef flats around Hawaii 
and emerges at night to forage. But no one had paid attention to the 
relationship with its bacteria — until 1988, when McFall-Ngai flew out 
to Hawaii to take a look. 

First, she had to learn how to catch the animals; in knee-deep water, 
she could snag dozens with just torches and nets. She began breeding 
them in 1989, when she started her own lab at the University of Southern 
California in Los Angeles. She found that just 8-10 pairs could produce 
60,000 juveniles a year. And unlike animals whose symbionts provide 
essential nutrients, the squid can survive without V. fischeri. This meant 
that McFall-Ngai could raise the partners separately, introduce them, 
and watch their first dates. 

But first, she needed a collaborator — someone who understood the 
bacterium. “I think I was the third microbiologist she came to and the 
first who said yes,’ says Ruby. The two had met when they were taking 
courses in Los Angeles. They have been professional partners ever since 
she started working with the squid, and romantic ones for most of that 
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time. “I think it’s a real symbiosis the two of them have,” says Nicole 
Dubilier from the Max Planck Institute for Marine Microbiology in 
Bremen, Germany. 

McFall-Ngai and Ruby embarked on a journey to unpick every aspect 
of the squid—bacterium symbiosis, at first in separate institutions, then 
on adjacent floors at the University of Hawaii at Manoa in Honolulu, 
and finally in adjoining rooms at the University of Wisconsin-Madison. 
They knew that the squid are colonized by V. fischeri within hours of 
hatching. But how does the bacterium infiltrate the light organ? And 
why is it the only species to do so, when other ocean bacteria collectively 
outnumber it 1,000-fold? To find out, McFall-Ngai carefully dissected 
the light organ, and Ruby loaded the bacteria with fluorescent proteins 
to track the microbes’ movements. 

Some details of the symbiosis are still falling into place. But the pair now 
know that the relationship begins on the underside of the newborn squid, 
when mucus-lined fields of beating hairs called cilia create a current that 
draws bacteria close. Physics then gives way to chemistry. When V. fischeri 
first touches the squid, it changes the expression of scores of squid genes 
—a finding’ made in 2013 by former postdoc Natacha Kremer. Some 
of these genes produce a cocktail of antimicrobial chemicals that create 
an inhospitable environment for most microbes while leaving V. fischeri 
unharmed. Others release an enzyme that breaks down the squid’s mucus 
to produce chitobiose, a substance that attracts more of the bacterium. 
It takes just five V. fischeri cells to trigger these changes, and the microbe 
soon dominates the fields of cilia (see ‘What the squid hid’). 

Chitobiose also stimulates the bacteria to start migrating into three 
blind-ended crypts in the squid’s light organ. Once they reach their desti- 
nation, they cause the pillar-like cells that line the crypts to become bigger 
and denser, enveloping the microbes ina tight embrace’. The crypts close 
off, sealing V. fischeri inside for the rest of the squid’s 3-10-month life’. 

In 2004, McFall-Ngai’s team showed that two molecules carried by 
the bacteria — peptidoglycan and lipopolysaccharide — are responsible 
for these changes*. That was a surprise. At the time, these chemicals 
were known only in the context of disease — they were described as 
pathogen-associated molecular patterns, or PAMPs, tell-tale substances 
that alert animal immune systems to burgeoning infections. McFall- 
Ngai took the acronym, swapped the pathogenic P for a microbial M, 
and rebranded them as MAMPs. These molecules, she proposed, can 
trigger debilitating inflammation but they can also start a friendship: 
without them, the squid’s light organ never reaches its mature form. 

To McFall-Ngai, these results hinted at a broader theme in biology: 
animals grow up under the influence of their microbes, not just the 
blueprints encoded in their genomes. “Most of us would say: Isn't that 
interesting? Margaret said: That's interesting ... and microbes play a role 
in development,’ says Angela Douglas, an entomologist and microbiolo- 
gist at Cornell University in Ithaca, New York. “She doesn’t deal in little 
ideas.” McFall-Ngai proposed’ the concept in 1991, and other scientists 
have confirmed it, finding that the bodies and immune systems of ani- 
mals ranging from tsetse flies to mammals mature properly only after 
exposure to bacteria — sometimes in response to the same MAMPs. 

Michael Hadfield, a marine biologist at the University of Hawaii, for 
example, has shown that the larvae of some marine worms metamor- 
phose into adults only when they encounter bacterial molecules®. This 
made sense when he considered that the earliest animals originated in 
oceans that were swarming with bacteria. “They very likely evolved to 
‘use’ those bacteria as a source of cues for developmental change,’ he says. 

McFall-Ngai has championed other ideas, too. One of them emerged 
when she started thinking about the adaptive immune system, a trade- 
mark of vertebrates that targets incoming microbial threats with bespoke 
antibodies and retains a memory of past encounters. Invertebrates, 
including squid, rely on innate immunity — a simpler, short-lived and 
ever-present battalion of defensive cells. Many immunologists had 
assumed that vertebrates evolved adaptive immunity because they live 
longer than invertebrates, and a more complex immune system affords 
them better protection against pathogens across an extended lifespan. 

In 2007 — just as interest in microbiomes was taking off — McFall-Ngai 
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WHAT THE SQUID HID 


The interaction between Vibrio fischeri and the juvenile bobtail squid 
means that this is the only bacterium to colonize the squid's light organ 
within hours of hatching. 


Approximate 
actual size 2mm 


2: Squid creates 
inhospitable 

environment for 
all but V. fischeri. 


1: Beating cilia 
draw in a mix 
of bacteria. 


offered an alternative explanation. In a commentary for Nature called 
Care for the Community’, she argued that adaptive immunity evolved 
because vertebrates need to control a more complex microbiome than 
invertebrates do. They use it to support beneficial microbes and to block 
those that pose a threat. 

Not everyone buys into the hypothesis. Forest Rohwer, an immunolo- 
gist at San Diego State University in California, points out that corals 
lack adaptive immunity but host some of the most complicated micro- 
biomes around. Still, he agrees that adaptive immunity might allow 
vertebrates to fine-tune their large microbiomes, and other scientists 
concur. “It’s a different way of thinking about the immune system,” says 
Douglas. “People can agree or disagree with it, but it is a touchstone. If 
someone says, ‘Remember Care for the Community? everyone knows 
what they mean. It’s short for a suite of ideas that challenge traditional 
notions in a really informed way.” 


TALK ABOUT A REVOLUTION 

McFall-Ngai exudes a stateswoman’s confidence as well as a scientist’s 
exuberance; friends describe her as regal. And so convinced is she by the 
importance of animal-microbe interactions that her message can verge 
on evangelism. “We now know that microbes make up the vast diversity 
of the biosphere, and that animal biology was shaped by interacting with 
microbes,’ she says. “In my mind, this is the most significant revolution 
in biology since Darwin.” 

McFall-Ngai has broadcast that message widely. In 2005, when the 
American Society for Microbiology was dominated by infectious-disease 
researchers, she persuaded the organization to run its first meeting on 
beneficial microbes — a meeting that continues to be popular today. She 
served ona National Academy of Sciences committee convened by Presi- 
dent Barack Obama to outline where biology in the United States will go 
in the 21st century. In 2012, she helped Newman to create a course that 
would teach undergraduates the principles of biology using microbes as 
the starting point of every topic — and she regularly flew from Madison 
to Pasadena during her holidays to teach the class. 
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Her passion for the squid has also spawned an academic dynasty. 
Ruby and McFall-Ngai have now trained dozens of scientists, around 
16 of whom are still studying the same symbiosis, now as heads of their 
own labs. But the duo discourages rivalries. “I grew up watching fields 
that eat their young, and I didn't want that,” says McFall-Ngai. The pair 
invites postdocs who set up their own lab to claim an aspect of the 
symbiosis for themselves — and every year they host a symposium- 
cum-party affectionately called the Pow-Wow, at which everyone gets 
together to share their results and plans. “If someone else says, ‘I was 
going to do that too, they sit in a corner and talk about it,” says Ruby. 

Despite the conviviality, the group knows that it must compete for a 
limited pot of funds. “I’ve been told, “We've already funded Margaret 
or Ned; how many more can we fund?;” says Spencer Nyholm, an early 
student of McFall-Ngai who now works at the University of Connecticut 
in Storrs. “I can’t imagine they would ask this if someone proposed to 
work with Drosophila or C. elegans or mice? 

McFall-Ngai says that she and her protégés are just getting started. 
In one project, she is examining an evolutionary theory predicting that 
every microbiome should be plagued by cheats — microbes that reap the 
benefits of life in their hosts but do not provide anything in return. Sure 
enough, the squid is sometimes colonized by strains of V. fischeri that 
do not make any light. McFall-Ngai’s team has found that the squid can 
use light-sensitive proteins in its light organ to detect a few dark bacte- 
ria among a million brightly glowing ones, and selectively evict them®. 
The team now wants to find out more about how it does this — and the 
answers might help to explain how humans and other vertebrates man- 
age more complicated microbiomes. 

The team has also shown that the squid’s relationship with V. fischeri 
varies over the course of a day, controlling the microbes so that they pro- 
duce light only at night’. And in 2013, former student Elizabeth Heath- 
Heckman showed that V. fischeri, in turn, influences the squid’s daily 
rhythms through a gene that makes a cryptochrome — a type of protein 
that affects circadian rhythms in many animals'’. Cryptochromes are 
usually activated by environmental light, but Heath-Heckman showed 
that one of the squid’s cryptochrome genes responds only to the blue 
light that V. fischeri emit, ramping up production of the protein. 

On the basis of this work, the team predicted that interactions between 
people and their resident microbes might also change from day to night 
— and soon, the evidence was pointing that way. Last year, a group in 
Israel showed that a significant proportion of microbes in the human 
gut rise and fall in abundance in a 24-hour cycle", and regular jetlag, for 
instance, can promote weight gain by disrupting these rhythms. 

“One of the things we pound into people who come to the labs is that 
nobody really gives a damn about the squid,” says Ruby. “They care about 
the big questions that the squid will help to answer.’ To tackle more of 
those questions, in a few months McFall-Ngai and Ruby will move to share 
the squid’s home. They will return to Hawaii, where McFall-Ngai will head 
the Pacific Biosciences Research Center in Honolulu. It is a dream job, and 
a chance to indulge more in her favourite pastimes — skateboarding and 
bodysurfing — as well as watch the squid on moonlit nights. 

“This was completely backwater science,’ she says. “Now it’s front-seat 
science. It’s been fun to watch people realizing that microbes are the 
centre of the Universe, and to see the field blossom.” = 


Ed Yong is a science journalist based in London. 
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Flood victims wait to be airlifted from a home near the mouth of the Limpopo River in Mozambique last year. 


Manage climate-induced 
resettlement 


Governments need research and guidelines to help them to move towns and villages 
threatened by global warming, argue David L6pez-Carr and Jessica Marter-Kenyon. 


as climate change claims their homes. 
Having endured repeated flooding and 
erosion from sea-ice melting and perma- 
frost thaw, the 400 residents of Kivalina, an 
Alaskan village on a low barrier island in 
the Chukchi Sea, voted in 1998 and in 2000 
to relocate — together — to coastal sites on 
higher ground. More than a decade and a half 
later, Kivalina remains in limbo, its move sty- 
mied by institutional, financial and physical 
barriers’. 
No US federal or state agency has a man- 
date to undertake such mass resettlement, 


[Es people are watching in horror 


even though the government spent more than 
US$15 million on erosion control between 
2006 and 2009. Kivalina has failed to raise 
funds through climate lawsuits against oil and 
gas companies. And it has yet to identify suit- 
able relocation sites. Meanwhile, the village's 
water-supply and waste-storage systems have 
been damaged, and it could become uninhab- 
itable within a decade. 

Tens of thousands 

of people in more  Tohear more about 
than 85% of Alaska’s climate resettlement, 
213 native villages face visit: 

similar threats’. 


Papua New Guinea, China and Vietnam 
have already relocated communities that were 
vulnerable to flooding”. More than a dozen 
developing countries, including Uganda and 
Bhutan, have submitted national adapta- 
tion plans to the United Nations that involve 
population resettlement’. Sea-level rise this 
century threatens the cultural and national 
survival of several low-lying island nations 
in the Pacific and Indian oceans’. By 2050, 
climate-related hazards such as flooding, soil 
salinization, coastal erosion and droughts 
could displace hundreds of millions of people 
around the world from their homes, either 
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> temporarily or permanently’. 

In many cases, the best way to protect 
cultures, livelihoods and social links will 
be to move as a group. Yet population relo- 
cation is practically off the climate-policy 
radar. The United Nations Framework 
Convention on Climate Change (UNFCCC) 
did not officially recognize the need for such 
resettlement until 2010. And science has 
barely begun to examine the human and 
environmental drivers, costs and conse- 
quences. How severe must a threat — real 
or perceived — be for people to feel com- 
pelled to move? What determines whether 
they relocate as individuals or together? And 
how can the social, economic and psycho- 
logical downsides of population resettlement 
be minimized? 

Social and environmental scientists and 
policy-makers need to invest in research to 
better understand and manage such resettle- 
ment. Relocation must be incorporated into 
climate-adaptation policy discussions and 
funding initiatives in the run-up to the next 
UNFCCC Conference of the Parties (COP) 
in Paris in December. 


LAST RESORT? 

In a Thomson Reuters Web of Science 
search, we identified just 30 papers that 
examined climate-induced planned reloca- 
tions. Most concerned coastal communities 
that were subject to flooding, even though 
drought is a greater driver of mass displace- 
ment. More than half of the case studies were 
in Alaska or Mozambique. 

By contrast, dozens of papers are pub- 
lished each year on migration of individuals 
and families. But voluntary, managed reset- 
tlement of entire towns and villages may be 
a more effective way to sustain livelihoods 
and cultures in some cases. 

That said, relocating communities is 
fraught with difficulty. In the past 20 years, 
more than 300 million people have been 


resettled as a result of conservation, urbani- 
zation or development schemes, including 
dam and road-building, mainly in developing 
countries’. Most moves faced local resistance 
and were detrimental to livelihoods, health 
and well-being’*. Remuneration for lost 
income, land and jobs rarely compensated for 
reduced access to resources, fractured social 
networks and emotional trauma””. 

Climate-induced resettlement is viewed 
widely as a last resort**. Many citizens of the 
Pacific island of Tuvalu’ and of coastal north- 
ern Australia’, for instance, are loathe to relo- 
cate in the face of sea-level rise and mounting 
cyclone intensity. Many members of dis- 
placed communities return to their homes 
once the immediate threat has passed”*. A 
few months after Cyclone Eline hit Mozam- 
bique in 2000, thousands of evacuated flood 
victims moved back to the Limpopo River 
valley, pushed by a lack of jobs in the reloca- 
tion villages and pulled by memories of hav- 
ing coped with earlier floods’. 

What makes people such as the Kivalinans 
want to move away for good, and together? 
Migration becomes almost certain once 
resilience thresholds are crossed — such as 
insufficient rain for farmers to grow maize 
(corn) or the disappearance of an island 
below rising waters. But even the realization 
of an impending permanent threat may be 
enough. By the early 1990s, it had become 
evident to Kivalinans that their way of life 
would become unsustainable in their current 
location within decades. 

When trying to decide whether to migrate, 
sociocultural, political and economic con- 
cerns usually trump environmental pres- 
sures’. The reason most people cite for 
moving is not to avoid the effects of climate 
change per se, but to enhance their liveli- 
hoods and to remain with family and friends. 
We expect groups to make judgements along 
similar lines, while weighing the magnitude of 
the threat and the resilience of the population. 


When a landslide in Bududa, Uganda, in 2010 wiped out homes and families, the Ugandan government 
advised people to evacuate. 


266 | NATURE | VOL 517 | 15 JANUARY 2015 


© 2015 Macmillan Publishers Limited. All rights reserved 


But cultural and political dynamics could 
hold even greater sway in decisions related 
to group resettlement. Cultural systems and 
traditional livelihoods may be better pre- 
served through relocation than by staying 
put and defending an existing settlement. 
Powerful interests, either within or exter- 
nal to the group, can suppress or coerce 
individual and household mobility. 

Place and timing matter. Although some 
communities will stay until the water laps at 
their doors, others seem poised to proactively 
relocate. For example, the Pacific island 
nation of Kiribati and the city of Miami in 
Florida are both threatened by sea-level rise. 
But whereas Kiribati is considering perma- 
nent relocation, possibly to Fiji, the people 
of Miami intend to stay put and are investing 
hundreds of millions of dollars in shoring up 
sea walls and drainage systems (see ‘On the 
move’). Such disparities need to be under- 
stood if the global community is to manage 
relocation equitably. 

Careful planning is needed. Discussions 
about resettlement in low-lying island states 
such as the Maldives may seem premature 
because sea-level rise is unlikely to inundate 
these regions for several decades. But with 
their physical survival and cultural sover- 
eignty at risk, the stakes are high. Planning 
in advance can save time, money and lives. 


POLICY PRIORITIES 
Communication and negotiation are also 
important. The value that a community 
places on resettlement compared with other 
options such as sea-wall construction, tem- 
porary emergency evacuation and disaster 
relief need to be assessed so that appropri- 
ate policies can be chosen”. Coastal cities in 
the developed world might prefer to invest 
in flood barriers, for instance. Some vulner- 
able groups may favour household migration. 
Temporary measures may be enough when 
disaster impacts are short-lived or infrequent 
and emergency relief is available. 
Government interventions could fail if 
officials misinterpret or do not appreciate 
people’s perceptions of risk***. The majority 
of homes built to rehouse flood-threatened 
communities elsewhere in Mozambique 
after Cyclone Eline remained empty because 
planners underestimated locals’ resolve and 
confidence in dealing with floods, as well as 
farmers’ investment in and reliance on agri- 
culture in the flood plains of their home’. 
Institutional and legal systems remain 
ill prepared for managing relocation in 
response to climate threats. For example, the 
US Federal Emergency Management Agency, 
which provides federal aid for preparing for 
disasters and for relief and recovery after 
them, has little power to manage pre-emptive 
resettlement. It can support community relo- 
cation only once disaster strikes, and only in 
response to a handful of hazards — including 
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ON THE MOVE 


As climate-change impacts increase, 
thousands of communities will face 
decisions about whether to move or 
stay and how best to protect 
themselves against rising seas, 
melting ice, drought and flooding. 


drought and hurricanes, but not the severe 
erosion that threatens Arctic settlements’. 

Several international policy initiatives 
have formed recently to develop resettlement 
guidelines. For example, the Nansen Initia- 
tive led by the Norwegian and Swiss govern- 
ments is planning a global meeting this year 
to agree to a set of best practices for dealing 
with cross-border climate displacement. The 
Peninsula Principles on Climate Displace- 
ment within States, developed in 2013 bya 
group of scientific and legal experts, provides 
a similar framework for assisting affected 
people within national borders’. 

These efforts are a start, but they remain 
scant and underfunded and are years from 
application. 


RELOCATION AGENDA 

With the worst impacts of climate change yet 
to come, there is a window of opportunity. 
Relocation must be higher on the agenda for 
global climate research and policy; ways to 
manage it must champion human rights and 
improve livelihoods. We recommend begin- 
ning with the following steps. 


Expand research. A global survey of climate- 
induced resettlement and the lessons learned 
is imperative. Analyses of the drivers, con- 
sequences and spatial distributions of reset- 
tlement at many scales are crucial to crafting 
informed policy. The UN, World Bank and 
Group of 8 nations should convene inter- 
national working groups on climate reloca- 
tion. Sustained research funding should be 


{ flooding and storms. 


resilience to hazards such 
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| from Vietnam’s river 

)| deltas as sea levels rise. 


long used resettlement 
strategies for environmental 
hazards including landslides. 


In the Maldives, 


380,000 citizens must 
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islands disappear below 


rising seas this century. 


settled if the 


Australians would relocate if dé 
water becomes scarce but would ‘ 
stay if desalinated or recycled 
water adaptations are tried. 


provided by nations in amounts that are pro- 
portionate to their carbon emissions. 


Apply existing international law. Planning 
must involve the people who will be affected. 
Guidelines governing the protection of devel- 
opment- and disaster-displaced people, such 
as the Hyogo Framework for Action, the UN 
Principles on Housing and Property Restitu- 
tion for Refugees and Displaced Persons and 
the World Bank Guidelines on Involuntary 
Resettlement, must be applied universally~”. 
The global community must promote the 
goals of the Nansen and Peninsula initiatives: 
to avoid permanent relocation where possi- 


ble, and to safeguard the needs and rights of 


people who are displaced by climate change. 


Address constraints to resettlement. Coun- 
tries should examine whether their institu- 
tional and legal systems are flexible enough 
to respond to relocation threats — internally 
and internationally — swiftly, successfully 
and comprehensively. States will need to adapt 
laws on disaster declaration and recovery to 
cover a broader range of environmental haz- 
ards, identify funding sources and address 
climate change within policies on housing, 
migration and natural-disaster response’. 


Improve adaptation financing. Funding 
is often an obstacle, even when everyone 
agrees that relocation is the best option. And 
resettlement projects are expensive. Billions 
of dollars will be needed to support the relo- 
cation of people in low-income countries. 


The UN member states that are responsible 
for the bulk of carbon emissions should bear 
the greater burden of this cost. 


Mitigate climate change and minimize its 
impacts. Reducing carbon emissions today 
is more cost-effective and less painful than 
doing so tomorrow. 


In December, the world’s climate activ- 
ists and policy-makers will convene in Paris 
for the 21st COP for the UNFCCC. There, a 
global strategy for climate relocation should 
be developed — for identifying it, under- 
standing it and managing it, equitably. m 


David Lépez-Carr and Jessica Marter- 
Kenyon are in the Department of Geography, 
University of California, Santa Barbara, 
Santa Barbara, California 93117, USA. 
e-mail: jsmarterkenyon@umail.ucsb.edu 
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John Marburger was the longest-running head of the US Office of Science and Technology Policy. 


SCIENCE POLICY 


From Brookhaven 
to Bush 


Peter Gluckman finds US presidential science adviser 
John Marburger’s posthumous collection enigmatic. 


he complex interfaces between 
science, society and public policy 
have evolved considerably over 
recent decades. The late John Marburger, 
US President George W. Bush’s science 
adviser from 2001 to 2009, wasa transitional 
and somewhat controversial figure within 
that time. Across his career, the mood was 
moving from the unquestioned acceptance 
of public expenditure on science that had 
prevailed after the Second World War to 
greater public interest in the value and soci- 
etal implications of science — and greater 
political debate about the role and findings 
of science. 

In his stint as director of Brookhaven 
National Laboratory in Upton, New York, 
from 1998 to 2001, Marburger emerged 
as an insightful intermediary between the 
scientific community and society. Yet when 
US science needed those very skills during 
the Bush era, Marburger seemed strangely 
silent, at least in public. Nor does this post- 
humous collection of his writing, Science 


Policy Up Close, reveal 
his thinking. Its editor, 
Robert Crease, main- 
tains the silence on 
this apparent paradox. 

But the science- 
policy nexus has two 
different faces. Policy 
advice for science uses 
levers (such as tax 


, 


Science Policy Up 


Close credits for research 
JOHN H MARBURGER and development) to 
ill; EDITED BY ROBERT influence the national 
P CREASE : 3 

faves science and innova- 
Harvard University . 
Press: 2015. tion ecosystem, most 


often with economic 
development as a primary goal. This is dis- 
tinct from science advice seeking to ensure 
that public policy is broadly informed by 
the best available evidence. Generally, both 
functions are vested in the same office — 
in the United States, that is the Office of 
Science and Technology Policy (OSTP). 
But when Marburger held the combined 
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posts of presidential science adviser and 
OSTP director, evidence-informed policy 
formation was apparently not a priority; 
and in Science Policy Up Close, he does not 
clarify whether this was by his choice or the 
administration's directive. Given that the 
Bush era revealed how vulnerable science 
can be in the face of organized vested inter- 
ests such as the oil industry, this omission 
is frustrating. 

Yet Marburger, a physicist, recognized 
the importance of achieving social licence 
for technology. This is seen clearly in his 
work at Brookhaven. In 1997, the lab had 
discovered a radioactive leak from a stor- 
age tank. Marburger realized that it was 
crucial to involve the local community in 
discussions. He writes about how — thrown 
into crisis-management mode over the 
public’s perception of risk at the site — he 
developed a signature approach to helping 
stakeholders to see others’ views. Similarly, 
when asking scientists to be accountable for 
the tax dollars they spend, Marburger was 
perhaps ahead of his time. His definition of 
accountability largely excluded scientists’ 
broader contributions to society, but this 
issue was a theme during his years at the 
OSTP, and coincided with a governmental 
turn towards greater investment logic and 
applying evaluation metrics in managing 
public science. 

In the book, however, Marburger does not 
address a headline issue of the Bush years 
— the public shift from trust in science to 
concerns over the government’ treatment of 
scientific evidence. His only defence is that 
he chose not to “waste energy” dealing with 
“controversies that were not in his power to 
influence”. That he reveals nothing of his 
own views on issues such as climate change 
or stem-cell research leaves an uncomfort- 
able vacuum. 

Our only window into his thinking is his 
set of annual addresses to the American 
Association for the Advancement of Science 
(AAAS) Forum on Science and Technology 
Policy, made while he was science adviser 
and reproduced in the book. In these, Mar- 
burger revealed what he felt were the key 
science-policy issues: scientific-workforce 
development and scientific immigration in 
the wake of the terrorist attacks of 11 Sep- 
tember 2001; the relative place of funding 
for discovery science versus directed fund- 
ing; and approaches to prioritization when 
national budget appropriations are made. 

It was only in 2004 that he seemed to 
comment on the role of science advice in 
developing public policy, after the non- 
profit Union of Concerned Scientists (UCS) 
issued a statement on ‘Restoring Scientific 
Integrity to Federal Policy Making. Mar- 
burger’s AAAS address that year was a well- 
scripted riposte to accusations that the Bush 
administration was ‘anti-science; stating that 


BILL INGALLS/NASA/GETTY 


LES ENLUMINURES 


“President Bush believes policy should be 
made with the best and most complete 
information”. Although ‘informatio 
does not necessarily mean ‘evidence’ that 
speech remains historically significant as 
an early instance of the closer relation- 
ship between science, society and policy 
that we know today. (Chris Mooney’s 
2005 The Republican War on Science 
(Basic Books) offers a very different 
interpretation of the events surrounding 
the UCS declaration.) 

Marburger’s policy comfort zone was 
clearly the meticulous analysis of the sci- 
ence and innovation ecosystem to better 
inform the appropriations process. His 
call for a new “science of science policy” 
— defining the metrics for evaluating the 
inputs and outputs of a public science 
system — is an important legacy that 
has helped to embed the concept in the 
government vernacular. That powerful 
focus on appropriations might have been 
a strategic way to promote evidence- 
informed public policy more broadly; 
but Marburger does not make that claim. 
It is one thing to support the production 
of evidence, and quite another to help it 
to find its way to the corridors of policy. 
Perhaps Marburger’s contribution was 
in supporting the 
supply of scien- 


be: “Itis one thing 
tific knowledge, to support the 
without concern- d ( 
ing himself with ja uction of 
the more complex evid ence, and 
business of devel- ule another 
oping the govern- to help itto 
ment’s appetite for find its way to 
the corridors 


its systematic use. 

Science Policy 
Up Close leaves 
the impression that Marburger might 
have had more to say had he been able 
to finish the book himself. Only the 
concluding essay offers a hint of his 
thoughts about the broader role of sci- 
ence in public policy and what he per- 
ceived as science’s lack of privilege in the 
seat of US governance. 

More than a decade after the UCS 
declaration, the favoured tactic for 
dealing with ‘inconvenient truths’ is 
perhaps less often about discredit- 
ing the science, and more often about 
acknowledging the evidence and placing 
it among the many legitimate inputs into 
policy and decision-making. But there is 
some way to go: although the science- 
policy nexus is maturing and becom- 
ing more nuanced, the challenges and 
loneliness of intermediary roles such as 
Marburger’s remain. = 


of policy.” 


Peter Gluckman is chief science adviser 
to the prime minister of New Zealand. 


HISTORY OF CHEMISTRY 


BOOKS & ARTS | COMMENT | 


Words into gold 


Philip Ball finds much wrestling with ideas in 
alchemists’ scribbled-over texts. 


r | Mhe sixteenth-century 
physician and alchemist 
Paracelsus claimed, “Not 

even a dog killer can learn his trade 

from books, but only from experi- 
ence.” As later ‘experimental phi- 
losophers’ turned alchemy 
into chemistry, they 
retained this affectation: 
in the seventeenth cen- 
tury, Robert Boyle is said to have 
claimed that he had learnt “more 
from men, real experiments, and 
his laboratory ... than from books”. 

Such comments seem to imply 
that alchemy and the transitional 
discipline of ‘chymistry’ were all 
about bench-top graft, in contrast to the 
medieval tradition of seeking knowledge 
in the library. Yet in most paintings of 
alchemists at work in the sixteenth and 
seventeenth centuries, books are ostenta- 
tiously on show. Apparatus lies unheeded or 
broken while the alchemist pores over a text, 
papers sometimes cascading in comic pro- 
fusion from desk to floor. In these images, 
books matter very much indeed: they seem 
to be where the real secrets lie. 

This vexed relationship is examined 
in Books of Secrets, an exhibition at the 
Chemical Heritage Foundation (CHF) in 
Philadelphia, Pennsylvania. Juxtaposing 
fifteenth-century alchemical books and 
manuscripts recently acquired by the CHF 
with its extensive collection of paintings of 
alchemists at their labours, the exhibition 
explores this early literature of proto- 
science, and what it was for. 

Alchemical books varied significantly. 
Some were esoteric treatises, all cryptic 
diagrams and encoded instructions for 
conducting ‘rubificatio and other chemi- 
cal procedures. Others were cheaply printed 
or hastily copied compilations of miscella- 
neous recipes for dyes, soaps and medicines. 
Both were apt to be marketed as ‘books of 
secrets. The term seems to promise for- 
bidden, mystical insights, but could also 
simply mean tricks of 
the trade. 

The new acquisi- 
tions, originally part 
of the Bibliotheca 


Books of Secrets: 
Writing and 
Reading Alchemy 
Chemical Heritage 


: ; Foundation, 
Philosophica Her- Philadelphia 
metica in Amster- pennsylvania. 
dam, include both Until 4 September. 


This alchemical manual may have become 
soot-smeared over a furnace. 


handwritten and printed documents, some 
attributed (often spuriously) to famous 
alchemists including Raymond Lull and 
Petrus Bonus. They reveal the character and 
functions of the literary culture of nascent 
chemical science from the Renaissance to 
the early Enlightenment. 

The books were evidently well used. The 
pages of one fifteenth-century compila- 
tion of Italian and English manuscripts 
arrived covered in dirt — or perhaps soot, 
from being read over a smoky furnace. The 
CHF’s curator of rare books, James Voelkel, 
persuaded conservator Rebecca Smyrl to 
avoid cleaning the pages: the ‘dirt may be a 
remnant of experiments. “It could be some- 
thing someone was trying to turn into gold? 
says Smyrl. 

To peruse these books is to glimpse a 
lively dialogue between author and reader. 
Despite the volumes’ costliness, some have 
words or passages crossed out or altered. In 
one sixteenth-century handwritten work, 
comments are squeezed into every corner 
of the margins: it is as much lab notebook as 
reference source. 

On this evidence, the painters had it 
right, even if their depictions of alche- 
mists often owed more to convention than 
observation. This band of proto-scientists 
engaged intimately with the words on the 
page. The text was not sacred, but it was 
indispensable. m 


Philip Ball is a writer based in London. 
e-mail: p.ball@btinternet.com 
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Correspondence 


Giant tortoises hatch 
on Galapagos island 


For the first time in 150 years 
and after more than 50 years of 
conservation efforts, the number 
of saddleback giant tortoises 
(Chelonoidis ephippium) seems 
set to recover unaided on 

Pinzon Island in the Galapagos 
archipelago. Rats, early whalers 
and pirates almost wiped out 
these ancient creatures. 

We found ten tiny, newly 
hatched saddleback tortoises on 
the island early last month. There 
could be many more, because 
their size and camouflage makes 
them hard to spot. Our discovery 
indicates that the giant tortoise is 
once again able to reproduce on 
its own in the wild. 

The Galapagos National Park 

and its collaborators set up a 
programme to save the tortoise 
in the 1960s, when only about 
100 animals remained. This 
involved collecting eggs and 
raising hatchlings in captivity 
for 4-5 years to reach ‘rat-proof’ 
size, as well as drastic rat- 
eradication measures (see Nature 
497, 306-308; 2013). These 
strategies have now enabled the 
species to stabilize itself. 
Washington Tapia Aguilera 
Galapagos Conservancy, Santa 
Cruz, Galapagos, Ecuador. 
Jeffreys Malaga Galapagos 
National Park, San Cristobal, 
Galapagos, Ecuador. 
James P. Gibbs State University of 
New York College of Environmental 
Science and Forestry, Syracuse, USA. 
jpgibbs@esf.edu 


Natural history: save 
Italy’s museums 


You call attention to the crisis in 
Italy’s natural history museums 
— in funding, personnel and 
administration, as well as in 
their visibility, research and 
purpose (Nature 515, 311-312; 
2014). There is a way to prevent 
the long-term management of 
these scientific collections from 
deteriorating into an elitist hobby. 
The Italian education and 


heritage ministries should 
together facilitate a functional 
and administrative connection 
between the country’s smaller 
natural history museums. 

Such a ‘meta-museum’ could 
coordinate long-term goals and 
scientific activities, enabling 
facilities and budgetary and 
technical resources to be 
shared — as in Germany's 
Senckenberg Research Institute 
and Natural History Museum 
in Frankfurt and Leibniz 
Association in Berlin. The 
largest of the museums could 
stay as independent scientific 
institutions, similar to London’s 
Natural History Museum and 
the National Museum of Natural 
History in Paris. 

To strengthen their scientific 
influence, these museums must 
participate in survey work and 
field research that contributes to 
the discovery, conservation and 
promulgation of national and 
global biodiversity. 

Franco Andreone* Museo 
Regionale di Scienze Naturali, 
Turin, Italy. 
franco.andreone@regione. 
piemonte.it 

*On behalf of 28 correspondents (see 
go.nature.com/c9bcvt for full list). 


Natural history: first 
museologist’s legacy 


As the Italian philosopher and 
naturalist Ulisse Aldrovandilay 
illin November 1603, he dictated 
his last will and testament — a 
remarkable and inspiring 
manifesto of scientific museology. 
He bequeathed his monumental 
collections and writings to be 
held in public trust so that they 
would be maintained for future 
generations of scholars. Sadly, his 
collections are among those now 
languishing in disarray in Italy 
(see Nature 515, 311-312; 2014). 
Aldrovandi conceived the idea 
ofa natural history museum, 
the first of which was created as 
a public institution in Bologna 
in 1547. He introduced the 
concept ofa sample type for any 
fossil species, an idea that was 


expanded during his lifetime by 
Francesco Calzolari in Verona, 
Michele Mercati in Rome and 
Ferrante Imperato in Naples — 
underscoring Italy’s crucial role 
in the birth and development of 
natural history museums. 

We have carelessly ignored 
the will ofa great father of 
museology. It is time to make 
amends by spurring a renaissance 
of these museums in Italy. 
Marco Romano Sapienza 
University of Rome, Italy. 
Richard L. Cifelli Sam Noble 
Museum, Norman, Oklahoma, USA. 
Gian Battista Vai University of 
Bologna, Italy. 
rlc@ou.edu 


Use mentoring to fix 
science inequality 


We suggest that mentorship 
is particularly important for 
scientists from the developing 
world (see Nature 515, 453-454; 
2014). It can address the problem 
of science inequality while 
helping to resolve global issues. 
Academics in developing 
countries are rarely able to 
take advantage of cutting- 
edge knowledge from leading 
universities, more than 90% of 
which are in high-income nations 
(see go.nature.com/wffbfi). 
Expenditure on research and 
development is typically less than 
1% of gross domestic product — 
one-third of the amount spent 
by most developed nations (see 
go.nature.com/gio8pu). 
Scientists from wealthy 
countries could be encouraged 
to mentor young scientists from 
developing nations by including 
mentorship requirements in 
grant awards, for example. 
Also, allocating resettlement 
funding can help to counter 
the ‘brain drain’ of researchers 
leaving low-income countries. 
It encourages them to return to 
share experience and knowledge 
and build local capacity. 
Malgorzata Blicharska Swedish 
Biodiversity Centre, Uppsala, 
Sweden. 
Grzegorz Mikusinski Swedish 


University of Agricultural Sciences, 
Skinnskatteberg, Sweden. 
malgorzata. blicharska@slu.se 


IPBES responds on 
conflicts of interest 


Contrary to the impression given 
by Axel Hochkirch and colleagues 
(Nature 516, 170; 2014), the 
Intergovernmental Platform 

on Biodiversity and Ecosystem 
Services (IPBES) is expected to 
approve a policy on conflicts of 
interest for authors at this week’s 
third plenary in Bonn, Germany. 

The IPBES has rules and 
procedures for nominating and 
selecting experts from a wide 
variety of sectors to ensure 
credibility and transparency 
(see go.nature.com/zhlosy). 
Accordingly, these experts are 
nominated by governments and 
stakeholders and selected by our 
Multidisciplinary Expert Panel. 

Our interim conflict-of- 
interest policy is in operation 
for the pollinators assessment 
(completed declaration forms 
available from the secretariat 
on request). The scientists 
from agrochemical companies 
mentioned by Hochkirch et al. 
were selected in their capacity as 
independent scientists to provide 
objective input for the pollination 
report. Moreover, independent 
peer review of all IPBES texts 
guards against any bias. 

Reports by other global and 
regional assessment panels 
routinely include authors from 
commercial sectors — for 
example, the Intergovernmental 
Panel on Climate Change and 
the International Assessment 
of Agricultural Knowledge, 
Science and Technology for 
Development. The latter 
was even accused of being 
unbalanced when industrial 
scientists withdrew from the 
assessment process (see Nature 
451, 223-224; 2008). 

Anne Larigauderie* IPBES, 
Bonn, Germany. 

anne. larigauderie@ipbes.net 

*On behalf of 4 correspondents (see 
go.nature.com/swyese for full list). 
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OBITUARY 


Alexander Grothendieck 


(1928-2014) 


Mathematician who rebuilt algebraic geometry. 


lexander Grothendieck, who died 
A: 13 November, was considered by 

many to be the greatest mathemati- 
cian of the twentieth century. His unique 
skill was to burrow into an area so deeply that 
its inner patterns on the most abstract level 
revealed themselves, and solutions to old 
problems fell out in straightforward ways. 

Grothendieck was born in Berlin in 1928 
to a Russian Jewish father and a German 
Protestant mother. After being separated 
from his parents at the age of five, he was 
briefly reunited with them in France just 
before his father was interned and then trans- 
ported to Auschwitz, where he died. Around 
1942, Grothendieck arrived in the village of 
Le Chambon-sur-Lignon, a centre of resist- 
ance against the Nazis, where thousands of 
refugees were hidden. It was probably here, 
at the secondary school Collége Cévonol, that 
his fascination for mathematics began. 

In 1945, Grothendieck enrolled at the 
University of Montpellier. He completed 
his doctoral thesis on topological vector 
spaces at the University of Nancy in 1953, 
and spent a short time teaching in Brazil. 
His most revolutionary work happened 
between 1954 and 1970, mainly at the Insti- 
tute of Advanced Scientific Studies (THES) in 
a suburb of Paris. His strength and dedica- 
tion were legendary: throughout his 15 years 
in mainstream mathematics, he would work 
long hours in the unheated attic of his house 
seven days a week. He was awarded the 
Fields medal in 1966 for his work in alge- 
braic geometry. 

Algebraic geometry is the field that studies 
the solutions of sets of polynomial equations 
by looking at their geometric properties. 
For instance, a circle is the set of solutions of 
x°+y’=1, and in general such a set of points 
is called a variety. Traditionally, algebraic 
geometry was limited to polynomials with 
real or complex coefficients, but just before 
Grothendieck’s work, André Weil and Oscar 
Zariski had realized that it could be con- 
nected to number theory if you allowed the 
polynomials to have coefficients in a finite 
field. These are a type of number that are 
added like the hours on a clock — 7 hours 
after 9 oclock is not 16 oclock, but 4 oclock 
— and it creates a new discrete type of variety, 
one variant for each prime number p. 

But the proper foundations of this enlarged 
view were unclear, and this is where, inspired 
by the ideas of the French mathematician 
Jean-Pierre Serre, but generalizing them 


enormously, Grothendieck made his first 
hugely significant innovation. He proposed 
that a geometric object called a scheme was 
associated to any commutative ring — thatis, 
a set in which addition and multiplication are 
defined and multiplication is commutative, 
ax b=bxa. Before Grothendieck, mathema- 
ticians considered only the case in which the 
ring is the set of functions on the variety that 
are expressible as polynomials in the coordi- 
nates. In any geometry, local parts are glued 
together in some fashion to create global 
objects, and this worked for schemes too. 
An example might help in illustrating 
how novel this idea was. A simple ring can 
be generated if we make a ring from expres- 
sions a+ be, in which a and bare ordinary real 
numbers but ¢ is a variable with only ‘very 
small’ values, so small that we decide to set 
&’=0. The scheme corresponding to this ring 
consists of only one point, and that point is 
allowed to move the infinitesimal distance ¢ 
but no further. The possibility of manipulat- 
ing infinitesimals was one great success of 
schemes. But Grothendieck’s ideas also had 
important implications in number theory. 
The ring of all integers, for example, defines 
a scheme that connects finite fields to real 
numbers, a bridge between the discrete and 
classical worlds, having one point for each 
prime number and one for the classical world. 
Probably his best-known work was 


272 | NATURE | VOL 517 |15 JANUARY 2015 


© 2015 Macmillan Publishers Limited. All rights reserved 


his discovery of how all schemes have a 
topology. Topology had been thought to 
belong exclusively to real objects, such as 
spheres and other surfaces in space. But 
Grothendieck found not one but two ways 
to endow all schemes, even the discrete ones, 
with a topology, and especially with the 
fundamental invariant called cohomology. 
With a brilliant group of collaborators, he 
gained deep insight into theories of coho- 
mology, and established them as some of 
the most important tools in modern math- 
ematics. Owing to the many connections 
that schemes turned out to have to various 
mathematical disciplines, from algebraic 
geometry to number theory to topology, 
there can be no doubt that Grothendieck’s 
work recast the foundations of large parts of 
twenty-first-century mathematics. 

Grothendieck left the IHES in 1970 for 
reasons not entirely clear to anyone. He 
turned from maths to the problems of envi- 
ronmental protection, founding the activist 
group Survivre. With a breathtakingly naive 
spirit (that had served him well in math- 
ematics), he believed that this movement 
could change the world. When he saw that it 
was not succeeding, in 1973, he returned to 
maths, teaching at the University of Montpel- 
lier. Despite writing thousand-page treatises 
on yet-deeper structures connecting alge- 
bra and geometry (still unpublished), his 
research was only meagrely funded by the 
CNRS, France's main basic-research agency. 

Grothendieck could be very warm. Yet the 
nightmares of his childhood had made hima 
complex person. He remained on a Nansen 
passport his whole life — a document issued 
for stateless people and refugees who could 
not obtain travel documents from a national 
authority. For the last two decades of his life 
he broke off from the maths community, 
his wife, a later partner and even his chil- 
dren. He sought total solitude in the village 
of Lasserre in the foothills of the Pyrenees. 
Here he wrote remarkable self-analytical 
works on topics ranging from maths and 
philosophy to religion. m 


David Mumford is professor emeritus 
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Massachusetts, and at Brown University in 
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Boxed up and ready to go 


Flow-tank experiments and fluid-dynamics simulations refute the idea that water movements over the body of boxfishes 
are a stabilizing influence, instead showing that the fish’s shape amplifies destabilizing forces to improve manoeuvrability. 


STACY C. FARINA & ADAM P. SUMMERS 


curious denizen of reefs, the boxfish 
At among the corals, turning this 

way and thatas it feeds on small inver- 
tebrates. The fish has been used as the bio- 
mimetic model for a low-drag concept car, the 
Mercedes-Benz Bionic, but it does not seem to 
be hydrodynamically gifted, because its swim- 
ming is neither swift nor effortless. Writing in 
the Journal of the Royal Society Interface, Van 
Wassenbergh et al.’ measured the hydro- 
dynamic properties of the boxfish shape by 
using three-dimensional (3D) printed models 
and computer simulations of fluid flow. They 
found that, contrary to previous research, the 
shape of the boxfish’s body creates high drag 
and passively lends itself to destabilizing flow. 

The boxfish would be an ideal costume fora 
fancy-dress party, because it can be modelled 
with a painted cardboard container with holes 
for the head, arms and legs. Its stiff carapace is 
an external skeleton made of plate-like, fused 
scales with large keels, like the edges of a box, 
that run along the length of the body (Fig. 1). 
Most other fishes power their swimming by 
moving their muscular bodies and tail from 
side to side, whereas the inflexible boxfish 
waggles its pectoral and pelvic fins, aided by 
occasional steering from the tail. Its progress 
through the water is largely determined by the 
shape of its wrap-around armour. 

For swimming and flying animals, stabil- 
ity and manoeuvrability are opposing needs, 
with a gain in stability usually meaning a loss of 
manoeuvrability. Observations of swimming 
boxfishes have shown that they are highly sta- 
ble during straightforward swimming, only 
infrequently being pushed off course by ambi- 
ent flows and their own body movements”’. 
Data from models of boxfishes suggest drag 
coefficients less than one-fifth of that ofa cube 
moving through the water. These observations 
led to the boxfish being touted as a model for 
a low-drag, high-stability shape for a high- 
volume structure’. However, boxfishes are 
also extremely manoeuvrable, able to make 
180° turns in the length of their body’. This 
presents a paradox — how can the carapace of 
a boxfish provide stability without inhibiting 
manoeuvrability? 

A series of previous studies had suggested 
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Figure 1 | Boxfish instability. The external skeleton of boxfishes — the carapace — is made up of 

rigid, fused scales. The edges of this carapace are called keels. Previous research*”’ had suggested that 
water flow leads to vortices forming around the keels that stabilize the boxfishes’ movements. However, 
Van Wassenbergh et al.' now show that the effect of these stabilizing vortices is outweighed by the 
destabilizing forces generated by the boxy front of the boxfish carapace, and this overall instability is what 


gives the boxfish its remarkable manoeuvrability. 


that the boxfish carapace is self-stabilizing*’. 
The authors of these studies proposed that, 
when the fish is thrown off course by turbu- 
lence, vortices form around the keels, pushing 
the fish back into a forward-facing position, 
so the keels act like the stabilizing flights of a 
dart. This passive process would require no 
energy or neural input from the fish, allow- 
ing instantaneous and inexpensive stabiliza- 
tion. Using 3D models of carapaces in a flow 
tank, the researchers visualized the vortices 
responsible for this self-stabilization and con- 
sistently found vortices occurring in positions 
that would provide stabilizing forces. 
However, in the latest study, Van Wassen- 
bergh and colleagues quantified flow around 
the entire carapace, not just around the 
keels, and found that the overall shape of the 
boxfish is actually destabilizing. The authors 
scanned the surface of the carapaces of two 
boxfish species to create 3D models, then 
used computational fluid dynamics to analyse 
the flows around each digitized shape. Drag 
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measurements are fraught with pitfalls, but 
these physical and computational models put 
the boxfish drag coefficient at twice the previ- 
ous values, which seems more likely. But the 
really interesting results addressed the paradox 
ofa stabilized yet manoeuvrable fish. 

On the basis of the shape of the carapace 
alone, a boxfish thrown off course by the cur- 
rent, or by its own fin movements, would tend 
to continue turning in the direction in which 
it had been pushed. Van Wassenbergh and col- 
leagues visualized vortices trailing from the 
keels of the carapace and also observed that 
these vortices produce stabilizing forces — just 
as the previous studies had shown. However, 
these forces were not nearly strong enough to 
overcome the larger destabilizing forces at the 
front of the carapace (Fig. 1). These destabi- 
lizing forces lead to great manoeuvrability, so 
the boxfish gains passive amplification of its 
movements from its shape. The authors cor- 
roborated this finding by printing their 3D 
models and measuring the forces acting on 


them in a flow tank under a variety of flow 
conditions. They again found that the shape 
of the carapace amplified the movements of the 
boxfishes, rather than stabilizing them. 

So it seems that, far from being darts gliding 
across the reef in a stable manner, boxfishes 
are tumblers, able to exploit small asymmetri- 
cal force inputs at the front of the carapace to 
generate large changes in direction. This raises 
an entirely different possibility for biomimetic 
applications, because the most manoeuvra- 
ble, low-radar-signature fighter jets, such as 
the F-117 Nighthawk, are also dynamically 
unstable’®. 

If the boxfish carapace has high drag and 
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Mixing it up 


is unstable, how was Mercedes-Benz able to 
model a low-drag car inspired by its shape? 
The answer lies in the nose of the car, which 
is rounded and so does not reflect the boxy 
front of the boxfishes. The front of the cara- 
pace amplifies the upsetting force, whereas the 
boxfishes’ keels are stabilizing. By retaining the 
keels but omitting the boxy head, the car com- 
bines stability with low drag. m 
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in the mantle 


Analysis reveals that the uranium isotopic composition of oceanic crust that is 
being subducted into Earth’s interior is distinctive, allowing the development of 
chemical heterogeneity in the mantle to be tracked. SEE LETTER P.356 


JON WOODHEAD 


researchers first proposed’ the radical 

hypothesis that oceanic crust, returned 
to the mantle (or subducted) during colli- 
sions between tectonic plates, could strongly 
influence the chemistry of Earth’s interior, and 
furthermore, that the tell-tale signatures of this 
process could be seen in the volcanic products 
of mantle melting. In particular, the chemi- 
cal traces of such ‘crustal recycling’ (Fig. 1) 
phenomena could be discerned in rocks 
termed ocean island basalts (OIBs) that are 
associated with volcanic ‘hotspots’ such as 
Hawaii. Variations on this simple yet provoca- 
tive idea have provided a focal point for studies 
of mantle geochemistry and planetary evolu- 
tion ever since. However, despite a substantial 
research effort and considerable advances in 
our understanding, definitive estimates of the 
timing of crustal-material transport into the 
mantle have remained elusive. In this issue, 
Andersen et al. (page 356) report on howa 
relatively new approach, using isotope ratios 
of the element uranium, provides some long- 
awaited temporal constraints on these crustal- 
recycling processes. 

It is generally accepted that recycled materials 
have a key role in the generation of compo- 
sitional heterogeneity in Earth’s mantle’, and 
indeed, evidence to this effect continues to 
appear’. By contrast, the question of when 
the mantle became modified in this way has 
proved remarkably intractable. For example, 


lE is now more than three decades since 


the abundances of the isotopes of lead (Pb) 
— derived from slow decay of long-lived 
uranium (U) and thorium (Th) parent nuclei 
— in OIBs form linear correlations that suggest 
a broad, model-dependent age range (about 
2.5 billion to 1 billion years) for the establish- 
ment of isotopic heterogeneity in their mantle 
source’. Another temporal constraint is pro- 
vided by the unusually low abundance ratios 
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of thorium to uranium (Th/U) observed in 
basaltic lavas erupted at Earth’s ocean ridges, 
known as mid-ocean-ridge basalts (MORBs). 
These ratios are lower than those estimated 
for the bulk Earth and have been explained® 
as resulting from uranium recycling into the 
mantle at subduction zones, perhaps starting 
about 2.4 billion years ago, coincident with 
the rise of atmospheric oxygen (and hence the 
availability of water-soluble hexavalent ura- 
nium, U(v1)). Beyond these few, rather impre- 
cise estimates, we have scant information on 
the timescales of crustal-recycling phenomena. 

Isotope geochemistry has always been an 
instrument-intensive discipline, quick to 
embrace new opportunities provided by tech- 
nological advances. The introduction of mass 
spectrometers known as multiple-collector 
inductively coupled plasma mass spectro- 
meters (MCICPMS) over the past 20 years 
has allowed detailed investigations of isotopic 
systems previously beyond our analytical 
capability, resulting in many breakthroughs. 
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Figure 1 | Crustal recycling. Oceanic crust (brown) is ‘recycled’ into Earth’s mantle at convergent 

plate boundaries (subduction zones). Over time, this crustal-recycling process has formed a chemically 
heterogeneous mantle mixture. Andersen and colleagues’ results’ place constraints on the timing of these 
events. They suggest that the upper-mantle source producing mid-ocean-ridge basalts (MORBs; short 
brown arrow) was contaminated in this way over the past 0.6 billion years, whereas heterogeneity in the 
deeper-mantle source producing ocean island basalts (OIBs; long brown arrows) probably resulted from a 
much older period of contamination between 0.6 billion and 2.5 billion years ago. 
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One of these has been the discovery that the 
abundance ratio of the uranium isotopes **U 
to?*U ?*U/?*U), long held to be invariant in 
nature, shows small variations”*. In their study, 
Andersen et al. used MCICPMS instrumenta- 
tion to explore the ramifications of this new 
paradigm for Earth’s uranium-isotope cycle. 
They detected subtle variations in **U/*"°'U 
in a range of geological samples, including 
OIBs and MORBs, and demonstrated that the 
present-day oceanic crust being subducted 
into the mantle is isotopically distinct from 
the bulk Earth, with high 38U PU values. 
Furthermore, they showed that this feature 
probably results from the emergence of fully 
oxygenated oceans 0.6 billion years ago. 

Importantly, the authors then observed 
that the mantle sources of MORBs and OIBs 
responded differently to subduction of this 
isotopically distinct crust. Whereas the shal- 
lower MORB source also shows high *U/?*U 
values, suggesting widespread pollution of its 
mantle source by subducted crust predomi- 
nantly in the past 0.6 billion years, the deeper 
OIB reservoir shows no sign of this effect. 
Andersen and colleagues propose that this 
reflects the greater antiquity of the recycling 
events contributing to the OIB reservoir. When 
coupled with previous observations of Th/U, 
these new data suggest ages for the deep OIB 
reservoirs of between 2.5 billion years (derived 
from model ages based on the abundance of 
Pb isotopes) and 0.6 billion years (when the 
oceans became fully oxygenated). 

As exciting as these results are, we must 
realize that the range of **U/’*'U variations 
observed is at the limit of what current instru- 
mentation can detect. In addition, there are 
currently no appropriate reference materials 
characterized to this level of precision. For 
this reason, further detailed studies will be 
required to confirm these remarkable observa- 
tions. Moreover, in an attempt to characterize 
‘average’ oceanic crust, the authors used com- 
posite samples of oceanic crust, derived from 
Ocean Drilling Program Site 801 in the western 
Pacific. It is well known that the oceanic crust 
is highly heterogeneous and thus a crucial goal 
for future studies will be to broaden the ura- 
nium-isotope database to determine whether 
the crustal composite from Site 801 is truly rep- 
resentative of subducting oceanic crust. 

It is also interesting that recent in situ analyses 
of sulfur isotopes in tiny olivine-hosted sulfide 
inclusions in OIB lavas from Polynesia’ have 
revealed isotopic compositions that could have 
been generated on the early Earth by photo- 
chemical reactions only before about 2.45 bil- 
lion years ago, providing a lower age limit for 
the mantle source of these particular lavas. At 
first sight, these results seem inconsistent with 
those of Andersen et al., but we need to bear 
in mind that both studies encompass only a 
small part of the diverse OIB compositional 
spectrum. Future investigations exploiting 
these emerging geochemical tools will need 


to examine the many other OIB flavours: only 
then may we finally unlock the secrets of crustal 
recycling. = 
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African dawn 


The African Genome Variation Project presents genotyping and whole-genome 
data from individuals across sub-Saharan Africa, giving insight into population 
history and guiding future genomic studies on the continent. SEE ARTICLE P.327 


RAJ RAMESAR 


he story of human origins and diversi- 

fication is the story of Africa, and there 

is a growing interest in this story being 
developed and told by Africans. The paper 
on the African Genome Variation Project 
by Gurdasani et al.', which is published on 
page 327 of this issue, is an excellent example 
of collaborative work by African and non- 
African researchers, and a timely addition to 
the trickle of human-genomic data aimed at 
systematically characterizing genetic diversity 
across Africa. The project presents genotype 
data from more than 1,481 individuals from 
18 ethnolinguistic groups in sub-Saharan 
Africa (Fig. 1), and whole-genome sequencing 
from 320 individuals representing 7 ethno- 
linguistic groups from 3 geographically dis- 
tinct regions — Ethiopia (northeast Africa), 
Uganda (east Africa) and southern Africa. The 
resource is the most comprehensive represen- 
tation of African diversity so far. 

Gurdasani and colleagues’ paper uses data 
from 2,864 individuals from 33 African and 
non-African populations to provide a bird’s- 
eye view of the dynamicity of the human 
genome and fine-tuned information on areas 
of genomic differentiation and fixation in cer- 
tain African populations. It also highlights the 
usefulness of this information for identify- 
ing genes related to disease susceptibility and 
resistance. The authors’ selection of popula- 
tions representative of the three major African 
ethnolinguistic groups (Niger-Congo, Nilo- 
Saharan and Afro-Asiatic) takes into account 
the fact that these groupings represent key 
points — source, nexus and/or destination 
— on the routes of early human migration, 
and thus reflect interesting patterns of genetic 
diversity and selection. 

A noteworthy example among the authors’ 
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findings is signatures of significant mixing 
between Eurasian and west African genomes 
suggesting that Eurasians migrated back 
into Africa in the period around 7,500 to 
10,500 years ago, following the original exodus 
of modern humans from Africa that occurred 
tens of thousands of years earlier. The authors 
also detect imprints of Khoe-San populations 
from southern Africa in modern west Afri- 
can genomes. Although others have reported 
the presence of genetic ‘tracks’ from Eurasian 
and Khoe-San populations in the genomes of 
modern east and southern Africans””, finding 
such traces in west African genomes is interest- 
ing because it is probable that this Khoe-San 


admixture represents 
The three oeaaa popula- 
major African A possible inter- 
ethnolinguistic é 
died pretation of these 
groups stu findings is that these 
rep resent Eurasian immigrants, 
key points on or indeed the Khoe- 
the routes of San, brought with 
early human them a ‘wanderlust’ 
migration. gene that was then 


integrated into other 
African groups and translated into the Bantu 
expansion — the series of migrations, occur- 
ring around 3,000-5,000 years ago, that spread 
the Niger-Congo ethnolinguistic group across 
much of sub-Saharan Africa. Large-scale 
whole-genome sequencing across Africa 
will provide many more insights into human 
diversity, evolution, population history and 
disease susceptibility, and emerging work on 
ancient African genomes will further help 
to resolve unattributed genetic diversity in 
African populations. 
Gurdasaniet al. also find that the Afro-Asiatic 
and Nilo-Saharan language groups make a 
larger contribution to African differentiation 
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Figure 1 | African genetic diversity. The African Genome Variation Project presents' genomic 
information from 18 ethnolinguistic groups across sub-Saharan Africa, representing populations from 
the west, east and south of the continent (“Ethiopia encompasses the Oromo, Amhara and Somali 
ethnolinguistic groups). Locations of spots are approximate. 


and diversity than previously estimated, 
suggesting that the Niger-Congo language 
groups (which represent the majority of the 
population across sub-Saharan Africa) were 
founded fairly recently by large numbers of 
individuals, coinciding with the Bantu expan- 
sion. The authors also observed a remarkable 
decrease in sub-Saharan population differen- 
tiation when Eurasian ancestry is artificially 
blocked using a computer program. This may 
at first seem surprising, but it could indicate a 
high level of within-group diversity resulting 
from the large migrations and founder popu- 
lations described above. Alternatively, it may 
represent back-migrations of relatively small 
bands of Eurasians who had been exposed to 
strong selective pressures and thus the reshap- 
ing of parts of their genomes. 

The African Genome Variation Project states 
that part of its vision is to facilitate medical 
genomic studies in Africa, through generating 
data on African populations and also through 
developing capacity for genomic research on 
the continent. Towards this goal, the authors 
present a comparison of existing genome chips 
— devices used for identifying genetic varia- 
tions that are associated with diseases — and 
suggest design modifications for chips that will 
better capture genomic variation in African 
populations. Such improvements in chip design 
will be of immediate benefit, especially to the 
large-scale genome projects covering different 
African populations that are already under way 
as part of the H3Africa Consortium’, 

Developing countries across the globe are 
at disparate stages of engaging with human- 
genomics research. In some parts of Asia 
and Latin America, there has been a ready 


engagement with genomics and other high- 
throughput technologies, as a result of the 
active involvement of academic institutions 
and governmental support. Such involvement 
is crucial if human genetics and genomics is 
to be integrated into education and training, 
and into other disciplines such as health and 
agriculture, and thus for this science to con- 
tribute to enhancing a state’s development. A 
good example of this integration is the coor- 
dinated effort of the 100,000 Genomes Project 
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in the United Kingdom with Health Education 
England, under the auspices of the UK 
National Health Service. This initiative inte- 
grates workforce training designed to raise 
awareness of the predicted impact of the ‘omics 
revolution on health care. 

Understandably, there has been substantial 
‘data-generation envy’ from countries and 
researchers on the African continent. 
Although adoption of the international 
genomics momentum is happening at dif- 
ferent speeds across the continent, large- 
scale genomic health projects such as the 
H3Africa Consortium, which is driven by the 
African Society of Human Genetics with fund- 
ing from the US National Institutes of Health 
and Britain’s Wellcome Trust, are serving as 
examples for networked research projects in 
Africa. However, effort is still needed to draw 
those researchers and countries not yet part of 
H3 Africa into this ‘genomic fest, and to ensure 
that government engagement, as well as edu- 
cation and training reforms, continue to grow 
beyond universities. m 
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Hydrophobic 
interactions in context 


The finding that immobilized ions can alter the strength of hydrophobic 
interactions between molecules suggests a strategy for tuning hydrophobicity to 
optimize molecular recognition and self-assembly processes. SEE LETTER P.347 


SHEKHAR GARDE 


iland water do not mix. At themolecular 
level, this ‘de-mixing’ tendency, 
known as the hydrophobic inter- 
action, is thought to drive many self-assembly 
processes, such as protein folding, formation 
of micelles and membranes, and molecular 
recognition’. On page 347 of this issue, Ma 
et al.” present experimental evidence that 
the strength of hydrophobic interactions is 


dramatically affected by two biologically rel- 
evant cations — ammonium and guanidin- 
ium — when these ions are immobilized near 
hydrophobic patches of molecules. 

The oily core of a globular protein is formed 
by segregation of its hydrophobic groups. By 
contrast, protein surfaces are mosaics contain- 
ing not only polar and charged groups, but also 
hydrophobic ones, which frequently occur in 
patches that are crucial for binding and rec- 
ognizing other molecules, and for assembling 
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Figure 1 | Context-dependent hydrophobic interactions. Ma et al.’ measured 
the hydrophobic interaction between the tip of an atomic force microscope 
(AFM) coated with a hydrophobic monolayer of molecules and various self- 
assembled monolayers of molecules on a gold surface in water. a, When amino 
(-NH,) groups were immobilized within nanometres of hydrophobic groups 


proteins into larger structures. These patches 
do not exist in isolation, but reside in the 
chemical and topographical context of their 
surroundings. There is a growing realization 
from simulation studies** that the surround- 
ing chemical environment affects hydrophobic 
interactions in non-intuitive ways, but quanti- 
fying this effect experimentally has remained 
a challenge. 

Hydrophobic interactions are affected by 
various factors, including temperature, pres- 
sure and the presence of various additives 
in solution. In particular, dissolved salt ions 
are free to move about and distribute them- 
selves, depleting from or binding to interfaces 
between dissolved molecules and the sur- 
rounding water, and thus affect the solubility 
and interactions of hydrophobic species in 
water®. But what happens when an ion is chem- 
ically immobilized near a hydrophobic patch? 

Ma and colleagues consider this question 
for ammonium and guanidinium ions, which 
are biologically relevant because they form 
the positively charged moieties of the amino 
acids lysine and arginine, respectively. The 
authors prepared self-assembled monolayers 
that present a mixture of hydrophobic groups 
(60%) and ionic species (either ammonium or 
guanidinium ions, 40%), which allowed them 
to position the ions within nanometres of 
hydrophobic regions. They then used atomic 
force microscopy (AFM) to measure the 
adhesive force between the resulting surfaces 
and a hydrophobic AFM tip in aqueous solu- 
tions (Fig. 1). 

To tease out the component of the force 
attributable to hydrophobic interactions, Ma 
and co-workers added methanol to the solu- 
tion, which eliminates most of the hydro- 
phobic interactions. The sensitivity of their 
experimental systems to methanol therefore 
acted as a signature of the hydrophobic com- 
ponent of the force. They also changed the pH 
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of the solution to toggle ions between their 
charged and neutral states. This was expected 
to affect electrostatic interactions, but the 
authors found that it also altered hydrophobic 
interactions. The investigators compared the 
behaviour of the mixed monolayers with those 
of reference systems, such as purely hydro- 
phobic or ionic monolayers, to draw their 
conclusions. 

The key result is that the strength of hydro- 
phobic interactions between the mixed mono- 
layer and the tip roughly doubles when neutral 
amine groups immobilized within nanometres 
of hydrophobic regions are charged to form 
ammonium ions by lowering the pH (Fig. 1a). 
By contrast, similarly placed guanidinium 
ions essentially eliminate the hydrophobic 
interactions at all pH values studied (Fig. 1b). 
Excitingly, these observations hold not only for 
interactions involving the extended hydropho- 
bic surfaces of self-assembled monolayers, but 
also for interactions between the hydrophobic 
AFM tip and hydrophobic patches on single- 
molecule helices of B-peptides that contain 
ammonium (lysine) or guanidinium (arginine) 
groups about 1 nm away from a patch. 

A major strength of the work is that it presents 
an experimental framework for characteriz- 
ing hydrophobic interactions that is versatile 
enough to probe how they are modulated by 
the proximity of other moieties. Although the 
authors focus on two cations here, this frame- 
work could be used to systematically study 
how a wide range of other groups and/or 
chemical patterns might modulate inherent 
hydrophobic interactions. 

Because hydrophobic interactions are medi- 
ated by water, a deeper discussion of their 
context dependence will require an under- 
standing of the role of water molecules. Previ- 
ously reported work” has shown that density 
fluctuations of water near a surface serve as 
one robust and context-dependent molecular 
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in a monolayer, and were then charged to form ammonium groups (-NH,’) 
by lowering the pH of the surrounding solution, the hydrophobic interaction 
doubled in strength. b, By contrast, when guanidinium groups (red, or their 
uncharged equivalents, guanidine groups, not shown) were incorporated into 
the monolayer, the hydrophobic interaction was essentially eliminated. 


measure of the hydrophobicity of the surface. 
Understanding how ammonium and guani- 
dinium ions influence water’s behaviour (such 
as its molecular packing, orientations and den- 
sity fluctuations close to surfaces) may shed 
light on the molecular origins of the current 
results. 

Ammonium and guanidinium ions inter- 
act with water molecules differently’. Ammo- 
nium is smaller than guanidinium and is well 
hydrated, and its salts are often used to stabilize 
or help to crystallize proteins. The guanidin- 
ium ion, with its central carbon atom attached 
to three amino (-NH,) groups, is a planar, disc- 
like molecule. It can donate hydrogen bonds 
to water molecules along its edge, but is easily 
depleted of water molecules along its flat faces®. 
This peculiar behaviour enables guanidinium 
ions to pair with other molecules, including 
with other guanidinium ions’, or with hydro- 
phobic surfaces along its flat face, and also 
to denature proteins (that is, to unravel their 
three-dimensional structures). What part 
this hydration behaviour of the ions plays in 
the effects observed by Ma and co-workers 
remains to be seen. 

The fact that ions in close proximity can 
dramatically enhance or diminish hydro- 
phobic interactions may provide an invalu- 
able handle for manipulating and designing 
protein-protein interactions. In this respect, 
the effects of negative ions — such as car- 
boxylates, phosphates or sulfates — could 
be just as interesting and relevant as those 
of the positively charged ions studied by Ma 
et al., because they are ubiquitous in naturally 
occurring compounds such as proteins, DNA 
and heparin (an anticoagulant). Indeed, when 
negatively charged ionic groups are inserted 
at the edge of a key hydrophobic binding loop 
of certain antibodies, the resulting molecules 
no longer stick to each other, but still bind to 
their targets’’: small aggregates or fibrils of 
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p-amyloid, the peptide that forms the main 
component of plaques found in the brains of 
patients with Alzheimer’s disease. 

More broadly, the fundamental questions 
about the molecular origins of the context 
dependence of hydrophobicity raised by Ma 
and colleagues’ work are ripe for investigation 
using theory simulations and experiments. 
Addressing these questions is important, 
because the findings reassert that the differ- 
ent factors involved when two proteins (or two 
chemically heterogeneous surfaces) interact 


with each other are not additive — throwing a 
spanner in the works of simplistic models that 
assume this. m 
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Dragonflies predict and 
plan their hunts 


An analysis reveals that the dragonfly’s impressive ability to catch its prey arises 
from internal calculations about its own movements and those of its target — the 
first example of such predictions in invertebrates. SEE ARTICLE P.333 


STACEY A. COMBES 


stage to meet his partner, who is leaping 

and pirouetting towards him. To catch her 
at the right moment, he must predict where 
she will end up and determine how he should 
move to intercept her. To do this, his mind 
anticipates how her image should grow as 
they move towards each other, allowing him 
to rapidly identify and react to unexpected 
changes, such as a stumble that lowers her 
speed. Until now, this type of complex con- 
trol, which incorporates both prediction and 
reaction, had been demonstrated only in 
vertebrates. However, in this issue, Mischiati 
et al.' (page 333) show that dragonflies on the 
hunt perform internal calculations every bit 
as complex as those of a ballet dancer. 

Dragonflies are formidable predators. With 
huge eyes that provide an almost spherical 
view of the world, they perch on vegetation, 
waiting for prey to drift overhead. When the 
time is right, they shoot off in pursuit, scoop- 
ing up victims with their hairy legs in less than 
half a second (Fig. 1). Dragonflies succeed in 
catching their prey about 95% of the time””, 
and this prowess has been attributed to their 
visual acuity and lightning-quick reflexes — 
in particular to the specialized visual neurons 
that detect the motion of a target and instruct 
the wings to react’. 

If dragonflies’ pursuits were guided purely 
by their reactions to the movements of their 
prey, one would predict a one-to-one mapping 
between prey manoeuvres and dragonfly reac- 
tions. Mischiati and colleagues show that this 
is clearly not the case. Dragonflies do respond 


I magine a ballet dancer moving across the 


to some prey manoeuvres, but more often they 
do not. And what’s more, the authors report 
that the majority of dragonfly manoeuvres are 
not associated with any change in prey motion. 

Some of these prey-independent manoeuvres 
are related to the mechanical requirements of 
prey capture: dragonflies align themselves with 
the flight path of their prey, approaching from 
below, most probably to reduce the likelihood 
of detection. Their bodies and heads move 
independently during prey capture, with the 
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head remaining locked onto its target” while 
the body manoeuvres into the optimal orienta- 
tion for capture. Until now, it had been assumed 
that these target-locking head motions were 
performed reactively, with dragonflies mov- 
ing their heads to re-centre the prey after any 
motion — either their own or that of their prey 
— that shifts the target from their sights. 

To tease apart the causes and consequences 
of head movements during prey capture, 
Mischiati et al. performed extremely accurate, 
high-speed measurements of prey position, 
and of dragonfly head and body orientation. 
Such measurements are possible only in a 
controlled, indoor setting, where dragonflies 
typically refuse to chase prey. To get around 
this problem, the authors constructed an 
indoor flight arena, complete with backdrops 
of natural scenery and lighting that simulated 
a bright, sunlit day. Once they had quantified 
the movements of dragonfly and prey, the 
researchers calculated how the image of the 
prey moved across the dragonfly’s eyes, as the 
result of the movements of both parties. These 
calculations revealed that the dragonfly’s 


Figure 1 | A dragonfly in flight. Mischiati et al.’ found that dragonflies on the hunt make internal 
calculations about the movements of their prey and themselves. 


15 JANUARY 


© 2015 Macmillan Publishers Limited. All rights reserved 


2015 | VOL 517 | NATURE | 279 


| RESEARCH | NEWS & VIEWS 


head motions are remarkably effective at 
cancelling out the large image drift across the 
eye that would have resulted from its own 
body rotations and the prey’s anticipated 
motion. Such cancellation ensures that the 
prey image remains within a few degrees of 
the dragonfly’s visual acute zone, in which its 
sight is at its sharpest. 

Most notably, these data show that, rather 
than adjusting head position after the prey 
image drifts outside the visual acute zone, 
dragonflies adjust their head positions in 
near-perfect synchrony with the motions that 
would cause image drift. This precise tim- 
ing led Mischiati and co-workers to surmise 
that dragonflies must be generating predic- 
tions using internal models of how prey- and 
self-motion will affect the location of the prey 
image on their eyes, and moving their heads to 
compensate before image drift occurs. 

This type of predictive control confers an 
advantage when compared to a purely reac- 
tive strategy. First, although the dragonfly’s 
response time is quite fast (approximately 
50 milliseconds), this still accounts for 10% to 
25% of a typical chase, so reacting only after 
each change in prey- or self-motion would 
extend the duration of a chase considerably. 
Second, because the dragonfly’s own body 
rotations cause much more image drift than 
the motion ofa distant prey item, nullifying 
this large image drift before it occurs means 
that the dragonfly’s visual system is more sensi- 
tive to unexpected prey manoeuvres, which it 
can then respond to reactively. 

Of course, there is a limit to how much 
laboratory studies can tell us about dragonfly 
predation in the wild. The current experiments 
used either slow, laboratory-reared fruit flies 
that rarely take evasive action’, or artificial 
prey undergoing a single change in speed. So, 
although Mischiati and colleagues’ results 
indicate that most manoeuvres relate to the 
dragonfly’s pre-choreographed capture strate- 
gies, in the wild, dragonflies must contend with 
prey that behave more unexpectedly. Many 
wild insects fly erratically at all times, or detect 
approaching predators and perform evasive 
manoeuvres. In these cases, reactive control 
is likely to dominate the dragonfly’s actions. 
Nonetheless, predictive steering strategies pre- 
sumably still underlie such more challenging 
pursuits. 

More broadly, Mischiati and colleagues’ 
results open up new avenues for exploring 
the mechanistic basis of complex behaviours 
involving both predictive and reactive con- 
trol. In situations such as those presented in 
this study’, the brain can align its internal pre- 
dictions with an appropriate reaction when 
reality deviates from expectations. These types 
of behaviour — particularly the use of ‘forward 
models, in which an animal predicts how its 
own actions will affect its sensory feedback 
— had previously been demonstrated only 
in vertebrates”, in which analysis of neural 


circuitry is challenging. By contrast, dragon- 
flies have accessible neural circuitry, and their 
relatively large size allows for measurements 
of behaviour and neural activity during free 
flight. Hunting dragonflies thus present a rare 
opportunity for conducting detailed, mecha- 
nistic studies of the neural circuits that under- 
lie complex behaviours. m 
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One catalyst, 
two reactions 


A catalyst has been tuned to make different compounds from the same molecules 
in carbon-nitrogen bond-forming reactions, depending on the conditions used. 
The products are potential building blocks for biologically active molecules. 


EMMANUELLE SCHULZ 


he ability to readily synthesize struc- 

turally complex molecules containing 

nitrogen atoms is crucial for organic 
chemists because such compounds have wide- 
spread applications, for example as drugs. But 
the nitrogen atoms must be incorporated into 
molecules at particular locations with respect 
to other atoms, using methods that are com- 
patible with chemical groups already present 
in those molecules. The three-dimensional 


arrangement of atoms must also be mastered 
to prepare ‘enantiopure’ compounds (single 
mirror-image isomers of compounds, called 
enantiomers), rather than a one-to-one 
mixture of enantiomers that must then be tedi- 
ously separated. This is crucial for medicinal 
chemists, because different enantiomers can 
have different, sometimes even opposing, 
biological activities. Writing in Nature Chem- 
istry, Shi and Buchwald’ report a variant of 
a carbon-nitrogen bond-forming reaction 
that solves many of the problems associated 
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Figure 1 | Selective hydroamination reactions of alkynes. Shi and Buchwald' report that a vinyl copper 
intermediate forms when an alkyne reacts with an electrophilic amine in the presence of copper(II) 
acetate, a ligand molecule (which binds to the copper ions) and a silane (HSiCH,(OC,H:),). a, In the 
absence of an alcohol, the intermediate reacts with the electrophilic amine to form an enamine. The 
carbon-nitrogen bond formed during the reaction is shown in red. b, But in the presence of an alcohol, 
an alkyl copper intermediate forms through a cascade of reactions, and produces amine products in 
enantiopure form (as single mirror-image isomers). R' to R* represent general chemical groups; Bz isa 


benzoyl group. 
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with making organic nitrogen-containing 
compounds. 

There are many ways of synthesizing 
carbon-nitrogen (C-N) bonds. Some are 
based on substitution reactions, in which an 
atom or group of atoms is displaced to form 
the desired C-N connection. Such reactions 
typically provide high product yields but gen- 
erate huge quantities of chemical waste, which 
is unsustainable given the need for environ- 
mentally friendly, ‘green’ chemistry’. Addition 
reactions, in which all the atoms present in an 
incoming molecule are incorporated into the 
target compound, are therefore preferred. 

The hydroamination reaction’ perfectly 
illustrates this second type of transformation. 
Its most direct form involves the addition of a 
nitrogen compound to carbon-carbon (C-C) 
double or triple bonds, generating both a 
C-N and a carbon-hydrogen (C-H) connec- 
tion without any by-products (Fig. 1). But the 
reaction is not straightforward, mainly because 
electrons in the two reactants repel each other. 
It is therefore crucial to use a catalyst to couple 
the reactants together. 

Because hydroamination has been so widely 
studied, numerous catalysts based on virtu- 
ally all the elements of the periodic table have 
been reported for this reaction. It is possible 
to obtain products with high enantiopurity, 
but there is not yet a universal method that 
is applicable to all molecules*. The reactions 
frequently require catalysts based on precious 
and expensive metals, harsh conditions (such 
as high temperatures) and chemically activated 
reactants, or they may be sensitive to factors 
such as air or moisture. They therefore do not 
meet the requirements of modern synthesis. 

Shi and Buchwald report a straightforward 
solution to these difficulties. They used a 
simple copper salt as a catalyst to perform 
hydroamination reactions between readily 
available alkynes (compounds with a C-C 
triple bond) and nitrogen-containing com- 
pounds known as electrophilic amines that 
have a specific affinity for the electronic char- 
acter of alkyne triple bonds. Starting from 
the same reactants, the authors found that by 
changing the reaction conditions, they could 
easily and preferentially prepare either one of 
two compounds — an enamine or an amine — 
possessing a highly desirable C-N bond. 

By combining an alkyne, an electrophilic 
amine, a copper salt, a ligand (an enantiopure 
phosphorus-containing molecule that binds to 
the reactive copper centre of the catalyst) and 
a silane compound (which provides hydro- 
gen atoms needed for the C-H bond forma- 
tion that occurs during the reaction), Shi and 
Buchwald observed the exclusive formation of 
enamines through an intermediate known as 
a vinyl copper species (Fig. 1). But when the 
authors simply added an alcohol (a source of 
hydrogen ions, H") to the mixture, no enam- 
ine formed; instead, the vinyl copper species 
underwent a cascade of reactions, ultimately 


and exclusively forming an alkyl copper 
intermediate. They then used this to prepare 
amines through the formation of new C-N 
bonds, as had previously been reported simul- 
taneously by Buchwald’s group” and by others®. 
Crucially, the isolated products are almost 
completely enantiopure. 

Shiand Buchwald have achieved a remarkable 
feat: by taming the reactivity of all the inter- 
mediate species, they performed reactions 
chemoselectively (forming either an enam- 
ine or an amine), regioselectively (grafting 
the nitrogen atom only to a targeted carbon 
atom) and, in the case of the amine forma- 
tion, enantioselectively. The authors went on 
to show that the reactions are applicable toa 
broad range of substrates, and that they work 
under ‘mild’ reaction conditions that toler- 
ate the presence of a wide range of chemical 
groups. Finally, the authors demonstrated the 
utility of their reactions by using them to trans- 
form a variety of easily prepared compounds 
into marketed pharmaceutical compounds — 
in one case, in nearly enantiopure form. 

This work represents a big step forward for 
the highly desirable hydroamination reaction. 
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Its strength lies in the variety of products that 
can be obtained by a simple change in the 
reaction conditions. Nevertheless, there is 
still room for improvement. In particular, the 
chemical groups that decorate the starting 
alkyne can affect the progress and outcome of 
the reaction. The search for universal catalysts 
is not yet over, but Shi and Buchwald’s discov- 
ery, based on the use of cheap and non-toxic 
copper salts, will doubtless inspire further 
progress. m 
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Seeking ultimate 


victory 


HIV variants that have mutated to escape T-cell immune responses dominate the 
latent viral reservoir in most patients on antiretroviral therapy. This finding will 
need to guide therapeutic approaches targeting reactivated virus. SEE LETTER P.381 


LOUIS J. PICKER & JEFFREY D. LIFSON 


Tzu advises would-be victorious generals 

to “know thy enemy”. Perhaps nowhere in 
medicine is this admonition more apt than 
in the effort to cure HIV infection. Although 
both physicians and patients would enthu- 
siastically welcome a simple, virus-directed 
therapy that could safely and efficiently purge 
HIV from the body, the biological obstacles to 
HIV cure — either viral eradication or com- 
plete, sustained, off-treatment remission — are 
too many and too complex to permit such a 
straightforward approach”’. A paper by Deng 
et al.* on page 381 of this issue contributes to 
the effort to defeat HIV by rigorously docu- 
menting one of these obstacles to cure, while 
at the same time pointing to a possible strategy 
to overcome this particular enemy strength. 

Although combination antiretroviral therapy 
(cART) is highly effective at suppressing active 
HIV replication, these drugs do not target the 
reservoirs of latent (non-replicating) HIV 


[I the ancient treatise The Art of War’, Sun 


that are established in infected CD4* memory 
T cells (a type of immune cell) early in infection 
and maintained over the lifetime of an indivi- 
dual?”. These latently infected cells, which do 
not express viral proteins and are thus invisible 
to surveillance by the host’s immune system, 
remain a source of virus that can reignite pro- 
gressive infection when cART is stopped. 
Drugs have been identified that can induce 
viral expression from latently infected cells 
in vitro, and possibly in vivo, and although 
overcoming latency remains a challenge, it 
looks increasingly likely that such agents will 
be improved to the point at which viral reac- 
tivation can be substantially accelerated*®. 
However, contrary to initial hopes, cells with 
reactivated virus do not invariably die as a result 
of cell-lytic viral replication, but instead may 
survive and possibly even proliferate’, indicat- 
ing that eliminating such cells is an important 
therapeutic goal. The obvious solution to this 
problem is to co-opt the host immune system, 
particularly the ability of CD8* T cells (also 
called cytotoxic T lymphocytes, or CTLs) to 
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Figure 1 | Immune escape complicates targeting of the HIV reservoir. 

a, A reservoir of CD4" T cells containing latent (non-replicating) viruses is 
established soon after infection with HIV. Deng et al.’ show that, in patients 
who have commenced combined antiretroviral therapy (CART) within three 
months of infection, most of these latent viruses have wild-type sequences at 
the regions encoding the viral structures towards which the CD8* cytotoxic 
T cells (CTLs) of the host immune system are predominantly directed 
(dominant viral epitopes; red). However, in patients starting cART later than 
this, more than 98% of the viruses have already mutated these sequences to 


recognize and kill infected cells that exhibit 
any HIV-gene expression. But even here, 
perhaps not surprisingly, the virus seems to 
retain the upper hand, by using its capacity for 
extensive genetic variation and rapid evolution 
to evade CTLs, through a process known as 
mutational escape’. 

Deng and colleagues addressed the key 
question of whether reactivated latent virus 
is recognized by an infected individual’s 
HIV-specific T cells. The answer, at least in 
individuals started on cART more than three 
months after infection (chronic-phase cART), 
was daunting: more than 98% of HIV inte- 
grated into the patients’ latently infected CD4* 
T cells contained sequence changes in regions 
(epitopes) of the viral Gag protein (the primary 
target of effective CTL responses) correspond- 
ing to previously characterized CTL-escape 
mutants (Fig. 1a). The researchers confirmed 
that these Gag mutations conferred func- 
tional immune escape by demonstrating that 
CD8* T cells from some of the patients given 
chronic-phase cART recognized the wild type, 
but not the mutant, forms of the relevant Gag 
epitopes. They also verified that the mutant 
sequences were present in replication-com- 
petent virus derived from the chronic-phase 
cART patients’ latently infected CD4* T cells, 
indicating that mutant virus could contribute 
to viral re-emergence if cART were stopped. 

By contrast, the authors observed that, in 
patients who began cART within the first three 
months after HIV infection, the latent virus 
was of predominantly wild-type sequence 
(Fig. 1a). Intriguingly, these findings suggest 
that the latent HIV reservoir in untreated 
subjects is more dynamic than was previ- 
ously thought. Of greater clinical importance 
is the implication that the conventional CTL 


b Conventional vaccine 


cART after 3 months 


responses against dominant HIV epitopes 
(which arise during the early phase of HIV 
infection and help to determine the level of 
viral replication during the chronic phase) will 
almost certainly not contribute to destroying 
reactivated latently infected cells (Fig. 1b). 
As such, these conventional CTL responses 
cannot be expected to supply the cell killing 
required for effective cure, even if the response 
is boosted by therapeutic vaccination using 
conventional vaccines — at least for the vast 
majority of HIV-infected individuals who are 
started on cART during chronic infection. 

But the news from Deng and colleagues is 
not all bad. The investigators also found that 
chronically infected patients on cART retain 
CD8* T cells that are specific for subdominant, 
non-mutated Gag epitopes, and show that 
these cells can recognize and even kill cells 
infected with mutated virus in vitro and in 
vivo (ina humanized mouse model). The ‘rub’ 
with these data is that the efficiency of this cell 
killing seems to depend on in vitro preactiva- 
tion of the CD8* T cells, and Deng et al. pro- 
vide no evidence that these broadly targeted 
CD8* T cells can be directly recruited in vivo 
to clear cells harbouring reactivating virus in 
cART-suppressed infections. 

However, the data are important for 
their clear indication that exploiting CTL 
responses in attempts to cure infection 
will require approaches that induce CD8* 
T-cell responses to subdominant epitopes 
(Fig. 1c) or to epitopes that are not natu- 
rally targeted at all during the course of 
HIV infection, as has been described for 
SIV infections in rhesus macaques’. Even 
these approaches will need to surmount yet 
more hurdles, including the presence of resi- 
dual virus in immune-privileged ‘sanctuary’ 
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‘escape the immune response (dominant-epitope escape mutants; orange). 
Viral sequences encoding structures towards which little CTL activity is 
directed (subdominant viral epitopes) are shown in blue. b, The findings 
suggest that, in patients who start cART after three months, conventional 
vaccines will be ineffective in combating viruses reactivated from this 
reservoir, because such vaccines primarily mobilize CTLs that target the wild- 
type dominant viral epitopes. c, Instead, vaccines will need to be designed that 
mobilize CTLs targeting subdominant (non-mutated) viral epitopes and that 
may thus be able to kill cells harbouring reactivated latent virus. 


sites, such as B-cell follicles”®. 

Sun Tzu also advises generals to “avoid 
what is strong, and strike at what is weak”. For 
curing HIV infections, this advice would 
translate to the identification and therapeu- 
tic targeting of HIV’s weaknesses, and sug- 
gests that understanding barriers to cure and 
defining the biology underlying these barri- 
ers are the first steps towards overcoming this 
viral enemy. HIV cure will almost certainly 
require a multimodal therapeutic approach 
incorporating both pharmacological activa- 
tion of latent reservoirs and immune-mediated 
clearance mechanisms, with each component 
designed to exploit one of this formidable 
enemy’s few weaknesses. = 
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he Nature Insight ‘Frontiers in Biology’ aims to 

cover timely and important developments across 

biology, ranging from subcellular molecular 
mechanisms to whole-organism physiology, and 
biomedicine. 

The Insight begins with a review on the amygdala, a 
brain structure that responds to both negative and positive 
associations during functionally different behaviours. 
This structure has assumed a complex position within the 
circuits underlying emotional and motivational processes. 
Embracing this complexity, Patricia Janak and Kay Tye 
reveal how advances in genetic targeting, anatomical 
tracing and neuronal activity modulation are providing 
the means to define a road map for this important 
structure's complex connectivity. 

David Artis and Hergen Spits then go on to review the 
rapidly moving field of innate lymphocyte biology. Innate 
lymphoid cells are the most recently identified constituents 
of the innate immune system. They have important roles 
in protective immunity and inflammation, integrating 
innate and adaptive immune responses and controlling 
tissue homeostasis in infection, chronic inflammation, 
metabolic disease and cancer. 

The ability to sense and to respond to nutrients such as 
glucose, amino acids and lipids is essential for life. David 
Sabatini and colleagues explore how mammals sense 
these nutrients and discuss the deregulation of sensing 
mechanisms in disease. 

Programmed cell death is essential for many 
physiological processes, including the shaping of 
developing organs, epithelial cell renewal and lymphocyte 
selection. Manolis Pasparakis and Peter Vandenabeele 
discuss the regulation, initiation and execution of different 
types of regulated cell death with a particular focus on the 
non-apoptotic cell-death process necroptosis. 

DNA methylation is an epigenetic modification that 
is generally associated with gene silencing. Recent 
technological advances have enabled the generation of 
genomic maps at unprecedented resolution that should 
help resolve its specific functions. In the final Review, 
Dirk Schtibeler summarizes our current understanding 
of the regulation and function of DNA methylation, and 
discusses its utility as a cellular marker in basic biology 
and biomedicine. 

Alex Eccleston, Noah Gray, Sadaf Shadan & 
Ursula Weiss 
Senior Editors 
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From circuits to behaviour 


in the amygdala 


Patricia H. Janak’? & Kay M. Tye? 


The amygdala has long been associated with emotion and motivation, playing an essential part in processing both fearful 
and rewarding environmental stimuli. How can a single structure be crucial for such different functions? With recent 
technological advances that allow for causal investigations of specific neural circuit elements, we can now begin to map 
the complex anatomical connections of the amygdala onto behavioural function. Understanding how the amygdala con- 
tributes to a wide array of behaviours requires the study of distinct amygdala circuits. 


differentiate us from other animals, we share emotional behav- 
iours — defined as behavioural responses to emotionally sig- 
nificant stimuli such as food or threats — with other vertebrates. The 
amygdala is a brain region that is important for emotional processing, the 
circuitry and function of which has been well-conserved across evolution 
(Fig. 1), although species differences do exist’. Even non-mammalian 
species such as reptiles, birds and fish have an amygdala-like brain region 
with similar circuits and functions to the amygdala in mammals”. 
Conservation of amygdala circuitry allows findings from one species to 
inform our appreciation of amygdala functioning in others. Understand- 
ing the intricacies of amygdala circuitry is of tremendous importance 
given that the amygdala is implicated in a wide range of disease states, 
including addiction, autism and anxiety disorders. Regarding forward 
translation, features identified using new approaches for neural circuit 
dissection in rodents have the potential to be directly relevant to humans 
in well-conserved structures such as the amygdala. For reverse translation, 
observed correlations between amygdala function and human behaviour 
can then be brought back to animal models to facilitate the elucidation 
of underlying mechanisms using systematic, iterative experimentation. 
The field is ripe for this type of translation, as reflected by the growing 
emphasis on amygdala research across species (Fig. 2). The collective body 
of work supports a view of the amygdala as a composite of parallel circuits 
that affect multiple aspects of emotional behaviour. This Review focuses 
on recent advances that have been enabled by technologies primarily used 
in rodents. Readers are directed to recent reviews for in-depth informa- 
tion on the amygdala and fear®, reward’, learning-related plasticity”, 
and anatomical or physiological characteristics”. 


A Ithough humans possess a number of cognitive abilities that 


First clues to the role of the amygdala in behaviour 

Lesion studies in non-human primates presented the earliest clues that 
the amygdala was important for emotional reactions to stimuli. In 1888, 
Brown and Schafer performed a bilateral ablation of the temporal lobe ofa 
thesus monkey (Macaca mulatta)", reporting that: “A remarkable change 
is ... manifested in the disposition of the Monkey ... He gives evidence 
of hearing, seeing, and of the possession of his senses generally, but it is 
clear that he no longer clearly understands the meaning of the sounds, 
sights, and other impressions that reach him.’ In addition, Brown and 
Schafer, and later Kliiver and Bucy“, reported reduced aggression, fear 
and defensive behaviours. Although these lesion effects were intermingled 
with features that we now attribute to other brain regions, these were the 


first reports describing the role of a brain region that is important for 
connecting stimuli with their emotional meaning. In 1956, Weiskrantz’° 
lesioned the amygdala, demonstrating an impairment in acquiring behav- 
ioural responses to shock-predictive cues, concluding that: “the effect of 
amygdalectomy ... is to make it difficult for reinforcing stimuli, whether 
positive or negative, to become established or to be recognized as such.” 

Following these early studies, amygdala lesions in both rodents'*” and 
humans’*”’ revealed a strong conservation of function across species, 
most notably an impairment in the recognition of fearful stimuli, and 
in a type of emotional learning called fear conditioning. Fear is typically 
studied in the laboratory using a form of associative learning known as 
Pavlovian conditioning, in which an initially neutral conditioned stimulus 
(CS) is paired with an aversive unconditioned stimulus (US), such as a 
footshock, leading the experimental subject to display behavioural signs 
of fear. This simple behavioural procedure provides a window into basic 
Pavlovian learning mechanisms that act to enhance survival, and is inti- 
mately tied to amygdala function. 


What is the amygdala? 

Although modest in size, the amygdala is comprised of multiple inter- 
connected nuclei nestled deep in the temporal lobe (Fig. 1). Here we 
focus on the basolateral complex of the amygdala (BLA; made up 
of the lateral (LA), basal (BA) and basomedial (BM) cell groups) and 
the central nucleus of the amygdala (CeA; made up ofa lateral (CeL) 
subdivision anda medial (CeM) subdivision). The BLA consists of glu- 
tamatergic principal neurons and inhibitory interneurons. CeA neu- 
rons are primarily GABAergic, with the CeL projecting to the CeM. An 
interconnected sheath of GABAergic neurons, termed the intercalated 
cells, is also found interposed between the BLA and CeA, providing an 
important source of inhibition”. 

A highly simplified view of information flow through the amygdala is 
as follows. The amygdala receives information about the external envi- 
ronment from the sensory thalamus and sensory cortices, which project 
strongly to the LA. The LA projects within the BLA to the BA and BM, 
as well as to the neighbouring CeA. The BLA is reciprocally connected 
with cortical regions, especially the midline and orbital prefrontal corti- 
ces (PFCs), and the hippocampus (HPC), as well as sensory association 
areas’; in primates, these reciprocal connections extend to primary sen- 
sory areas’. Hence, the BLA transmits information widely throughout 
cortical regions, but its neuronal processing is greatly affected by excita- 
tory projections from these regions. The relative enlargement of the BLA 
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compared with the CeA between rodents and primates might result from 
the substantial increase in the size of cortical regions that communicate 
with the BLA in primates”. Predominantly unidirectional outputs of the 
BLA include the striatum, especially the nucleus accumbens (NAc), and 
the bed nucleus of the stria terminalis (BNST) and the CeA. In turn, the 
striatum, BNST and CeA have been considered to mediate the transla- 
tion of BLA signals to behavioural output. Of note, there are exceptions 
to this serial model of information flow; the BA and CeA also receive 
sensory inputs, and the CeA contributes to some behavioural processes 
independently from the BLA” 


From rodents to humans and back again 

The neural circuits underlying Pavlovian fear conditioning have predomi- 
nantly been explored in rodents. Simple tone-shock pairings produced 
robust and reproducible changes in amygdala neural responses to the 
tone™”®, such that neuronal spiking tracks the acquisition and extinction 
of fear behaviour in response to the tone. Human studies using functional 
magnetic resonance imaging (fMRI) adapted these simple fear-condi- 
tioning tasks and showed that the human amygdala is also activated by 
fear-conditioned stimuli, and that this activation wanes on extinction”. 
Furthermore, ventromedial prefrontal cortex (vmPFC)-amygdala cir- 
cuitry was found to mediate fear extinction in both rats”*° and humans”. 
By applying tasks designed for rodents to humans, the circuits mediating 
fear acquisition and extinction were shown to be well conserved. 

Not only does fear conditioning activate both the rodent and human 
amygdala, but these animals also share long-term memory processes 
for fear memory. When consolidated fear memories of rodents are 
reactivated during retrieval, they become labile, thereby requiring 
‘reconsolidation’”. During this labile state, memories can be modified 
by interfering with the reconsolidation process using protein-synthesis 
inhibitors” or with the non-invasive presentation of non-fearful infor- 
mation in mice, rats and humans*”™. 

Reverse translational approaches have become more frequent with the 
advent of new circuit mapping and manipulation technologies in rodents. 
One example in which a correlation in humans was taken to causation in 
rodents comes from fMRI studies in people with generalized anxiety dis- 
order®. The observation of abnormal functional connectivity between the 
BLA and the CeM in these people inspired the application of optogenetic 
tools to probe specific projections within mouse amygdala. Optogenetics 
allows for the rapid and reversible activation or inhibition of neurons by 
directing light towards neural elements that have been artificially induced 
to express light-sensitive opsins. In the first demonstration of optogenetic 
projection-specific manipulation in a freely moving animal, the increase 
or decrease of transmission between the BLA and the CeA was shown 
to cause the reduction or augmentation of anxiety-related behaviour”, 
complementing the {MRI findings. 

Any attempt to define the behavioural functions of amygdala neu- 
ronal activity is confronted by the dense interconnections among amyg- 
dala nuclei and between amygdala nuclei and other brain regions, and 
by the lack of a predictable distribution of functional cell types. The 
availability of new neurotechnologies and approaches is overcoming 
these hurdles and accelerating the mapping of function onto amygdala 
circuitry, revealing a complex picture of amygdalar control of behaviour. 
These findings, achieved using optogenetic and pharmacogenetic acti- 
vation or inhibition, in conjunction with behaviour and electrophysiol- 
ogy, reveal causal relations between amygdala cell types and projections 
in various behaviours (Fig. 3). As we discuss, the behavioural functions 
of afferent and efferent projections had not been determined at this 
level of specificity until the application of these new technologies for 
circuit manipulation. 


New insights into circuitry for fear 

The investigation of the neural basis of fear learning and expression 
led to the view that the amygdala is a rapid detector of aversive envi- 
ronmental stimuli and situations, producing affective or behavioural 
states to allow for adaptive responses to potential threats*’”**. Along 


REVIEW 


Figure 1 | Evolution of the amygdala across species. Primary amygdalar 
nuclei and basic circuit connections and function are conserved across species. 
An enlarged image of the basolateral complex of the amygdala (BLA) and 
central nucleus of the amygdala (CeA) or analogues are shown next to a coronal 
section from the brains ofa lizard, mouse, rat, cat, monkey and human. 


the way, this line of research uncovered crucial brain mechanisms of 
associative conditioning, arguably providing us with our best under- 
stood neurobehavioural model of learning. 

The LA has been the focus of many studies because it has ready access 
to information about the auditory cue used in conditioning, and lesions 
of this region block acquisition of conditioned freezing’®'”*”. LA neu- 
rons develop and maintain excitatory neural responses to the onset of 
an auditory cue that has been paired with a footshock US”, These 
responses in vivo probably arise from potentiation of sensory inputs onto 
LA neurons because CS-US pairings enhance measures of excitatory 
synaptic plasticity in vivo” and in acute amygdala slice preparations. 
In this model, an initially weak afferent carrying sensory information 
about the CS and a strong afferent carrying US information converge 
onto individual principal neurons in the LA and, through a Hebbian 
plasticity mechanism, lead to enhanced strength of the excitatory syn- 
apses carrying CS information. This experience-dependent synaptic 
strengthening allows the presentation of the CS alone to activate LA 
neurons. The model of CS and US convergence has been explored by 
studies that took advantage of the temporal specificity of optogenetics 
to activate neural elements in a time window corresponding to the few 
seconds of CS or US presentation. The first study expressed the excitatory 
opsin, channelrhodopsin 2 (ChR2), in LA principal neurons to allow 
for rapid and reversible activation of these neurons during behaviour. 
When paired with an auditory CS, this simultaneous photoactivation of 
LA neurons could be used as a substitute for the footshock US, resulting 
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Figure 2 | Number of studies on the amygdala. Publications on the 
amygdala indexed on PubMed between 1950 and 2013 demonstrate the 
growing interest in amygdala research. 


in conditioned freezing*®. The second study showed that brief photo- 
activation of ChR2-expressing LA axonal terminals from the auditory 
thalamus (the medial geniculate nucleus, MGN) and the auditory cortex 
(AC) can substitute for a tone CS when paired with footshock, resulting 
in conditioned freezing and synaptic potentiation”. 

Lesions of the CeA block expression of conditioned fear”, leading 
to the hypothesis that projection routes from the LA to the CeA could 
allow for conditioned freezing behaviour. Notably, the LA does not pro- 
ject directly to the CeM, the proposed output nucleus that projects to 
non-amygdala regions mediating behavioural and autonomic signs of 
fear””°". The advent of cell-type- and projection-specific manipulation 
shed light on information flow from the LA to the CeA, and then within 
the CeA itself in terms of conditioned fear — information that was 
unattainable from lesions that remove the entire network. An important 
step was identifying two subpopulations of neurons within the CeL 
that have opposing functions and distinct genetic markers. These are 
neurons in the CeL that are inhibited in response to CS following fear 
conditioning and express protein kinase C (PKC)6, termed CeLo,; cells, 
and neurons that are excited by CS following fear conditioning that 
are PKCS , termed CeLoy. Although both subtypes of CeL neurons 
inhibit one another and both project to the CCM”, CeLoy neurons 
respond to a fear CS at a shorter latency than CeLo;;”, suggesting that 
conditioned fear responses occur following activation of CeLo, neurons 
that inhibit CeL,,,, neurons projecting to CeM output neurons”, thereby 
promoting freezing through disinhibition”. 

These studies did not, however, directly address the issue of transmis- 
sion between the LA and the CeA. A recent series of studies capitalized 
on the advantages of genetic targeting in mice to characterize a neuronal 
population in the CeL that receives LA input and might be involved in 
fear conditioning. Using a transgenic mouse line in which Cre recombi- 
nase is expressed in somatostatin-positive (SOM”") neurons, the ability of 
ChR2-expressing LA terminals to activate these cells could be determined. 
The LA was shown to form functional excitatory synapses on SOM" and 
PKCS cells in the CeL (CeL:SOM"), and the excitatory strength of these 
synapses greatly increased as a result of fear conditioning™. This find- 
ing is notable because the LA was considered to be the primary site for 
learning-related plasticity underlying fear conditioning. The presence of 
experience-dependent potentiation in the CeL indicates that this region 
is also important for acquisition of the CS—-US association, as suggested 
by earlier studies”, rather than merely serving as a relay of informa- 
tion from the LA. 

The importance of this experience-dependent plasticity onto 
CeL:SOM* neurons in fear acquisition was demonstrated using a phar- 
macogenetic approach to reversibly silence only the CeL:SOM* neurons 
during the acquisition phase of fear conditioning. Using a cre-dependent 
virus to express the inhibitory DREADD receptor hM4Di, an engineered 
G-protein-coupled receptor activated by its exogenous ligand clozapine 
N-oxide, the SOM’ cells could be reversibly and selectively inhibited. This 
inhibition prevented fear acquisition and excitatory synaptic enhance- 
ment™. Using the retrograde tracer cholera toxin B injected into SOM* 
terminal regions to allow whole cell recording from projection-defined 
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cells, it was revealed that SOM‘ cells with learning-induced synaptic 
enhancement project directly to the periaqueductal grey (PAG) as well 
as the paraventricular nucleus of the thalamus (PVT), bypassing the 
CeM. Thus, CeL projections to the CeM might not be the only CeL projec- 
tions that contribute to freezing behaviour after fear conditioning. This 
evidence revises the view of the CeM as the primary output station of the 
amygdala and indicates that information at various stages of processing 
in the CeA is sent to the PAG. 

Although pieces of information are continually emerging, these 
circuit analyses reveal a rich interaction between the LA and CeL — 
and among neurons within the CeL — in both acquiring and express- 
ing conditioned freezing in response to a fear cue, as well as multiple 
neural projections from the CeL that may regulate this behavioural 
response. Fear conditioning to the environment in which footshock is 
experienced is also mediated by the amygdala. Circuit approaches to 
contextual fear conditioning are underway; a recent study found that 
optogenetic inhibition of the BLA terminals in the entorhinal cortex 
(EC) impairs the acquisition of contextual fear conditioning”, whereas 
photoexcitation of oxytocin (OT)-releasing fibres from the hypothala- 
mus to the CeL suppresses expression”. 


Beyond fear in the amygdala 

The robustness of fear conditioning provided a highly replicable behav- 
ioural paradigm that allowed researchers to delve deep into the circuit 
mechanisms underlying this simple behaviour. However, the view that the 
amygdala is specialized only for fear conditioning is too narrow. Strong 
evidence supports an integral role for this region in other aversive states, 
such as anxiety, as well as in reward, providing a challenge for determin- 
ing how neuronal activity within the amygdala could contribute to each 
of these distinct processes. 

Cell-type- and projection-specific manipulations are providing a 
way forward. An example comes from the reverse translational study of 
anxiety described earlier. To probe the role of BLA—CeA projections in 
anxiety-related behaviour, the excitatory opsin, ChR2, was expressed in 
BLA neurons. Consistent with previous work, BLA-cell-body activation 
decreased time spent by mice in the open arms ofan elevated plus maze 
(EPM), indicating an increase in anxiety-like behaviour’. However, on 
selective excitation of the BLA-CeL pathway an anxiolytic behavioural 
phenotype of exploration of the EPM open arms was observed. Thus, 
a glutamatergic BLA projection to the CeA can promote anxiolysis”’. 

In this study, the identity (that is, ON or OFF cell, PKCS* or PKCS ) of 
the CeL neurons activated by BLA terminal optical stimulation was not 
determined. New evidence suggests that these neurons are CeL:PKC8"; 
when ChR2 was selectively expressed in CeL:PKCS5* neurons to allow 
their direct photoactivation, increased time in the open arms of the 
EPM was observed”. These data support the notion that a population 
of BLA neurons preferentially excites CeL:PKC5* neurons that lead to 
reduced anxiety, perhaps through the projection from CeL:PKCS* to 
CeM output neurons. Alternatively, the CeL:PKCS* neuronal projection 
to the BNST, also implicated in anxiety” °-60 could mediate anxiolysis. 
These studies show that the BLA-CeA pathway does not universally 
promote aversive states. Of note, there are other pathways that originate 
in the BLA that have since been implicated in bidirectional regulation of 
anxiety. Optically driving BLA projections to the ventral HPC (vHPC) is 
anxiogenic”, whereas photostimulation of BLA inputs in the anterodor- 
sal BNST (adBNST) is anxiolytic”. Photostimulation of the BLA-vHPC 
pathway also decreases social interaction, an effect perhaps related to 
the increase in anxiety that is seen following BLA-vHPC activation™. 

New findings continue to enlarge our conceptions of CeA circuitry 
in non-fear behaviour. In a recent study on feeding suppression, genetic 
targeting of CeL:PKCS" neurons was used to selectively express hM4Di 
to allow for reversible neuronal inhibition. In mice, reducing the activity 
of CeL:PKCS" neurons was found to block decreased feeding stimulated 
by sickness or unpalatable tastes™*. Conversely, optogenetic excitation of 
CeL:PKC6* neurons dramatically decreased feeding in both hungry and 
sated mice, and this was not a confound of reduced activity or increased 
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anxiety. These results indicate that CeL:PKC6* neurons suppress feeding 
in response to anorexigenic stimuli™. 

What inputs might normally serve to drive CeL:PKCS* neurons and 
suppress feeding? Tract tracing methods using cre-dependent retro- 
grade and anterograde viral techniques can reveal the specific inputs and 
outputs of a given cell type. Restriction of rabies virus to CeL:PKCS* 
neurons to allow labelling of presynaptic inputs, combined with c-fos 
immunohistochemistry to measure cellular activation, revealed multiple 
inputs to CeL:PKC6* neurons that are activated by anorexigenic signals, 
including the BLA, the parabrachial nucleus (PBN) and the insula (IN). 
These findings collectively demonstrate a unique pathway for feeding 
suppression that relies on CeL circuitry and that was not readily identi- 
fied using pharmacology or lesion experimental approaches. 

The identification ofa role for amygdala circuitry in anxiolysis and feed- 
ing suppression makes one look at this circuitry differently in comparison 
to when only considering fear. How could these processes co-exist? Excit- 
atory responses in the BLA toa fear cue activate CeLoy:PKCS cells, some 
of which are SOM’, to initiate freezing. By contrast, excitatory responses 
in the BLA also activate CeLo;,:PKCS* cells to promote anxiolysis; to 
account for these findings, separate functional groups of BLA neurons 
projecting to CeL are proposed (Fig. 4). One route for the production of 
freezing and other measures of fear is through inhibition of CeL:PKCS* 
neurons that disinhibit CeM neurons; additional routes through CeL 
projections to other targets might also mediate conditioned freezing. 
Thus, the postsynaptic targets that mediate anxyiolysis after excitation of 
CeL:PKCS* neurons may or may not be CeM neurons, and if they are, it 
is possible they are different CeM neurons to those mediating freezing. 

In summary, recent studies have demonstrated that the BLA-CeA 
circuitry is involved in a diverse array of behaviours in addition to those 
related to fear. The key to understanding the production of these differ- 
ent behaviours is in the functional anatomy. 


Adding in reward 

The above findings indicate that there must be a diversity of neuronal 
responses in the BLA, as opposed to only responses to fear cues; in fact, 
this was well known from in vivo electrophysiological recordings and is 
necessary to explain the effects of amygdala lesions on other behaviours 
besides fear conditioning. In parallel to early studies on fear conditioning, 
amygdala lesions were also found to impair reward-based behaviour® ”. 
For example, LA lesions prevent amphetamine place preference condi- 
tioning”, a procedure in which subjects learn to associate a particular 
spatial location with the reinforcing effects of the drug, and CeA lesions 
prevent conditioned orienting responses to reward-predictive cues™. 
Hence, the same lesions can impair Pavlovian conditioned behaviour in 
response to cues signalling either a rewarding or an aversive outcome. 

Notably, amygdala lesions do not impair all behaviours emitted in 
response to reward-predictive cues, rather, they impair the ability to 
respond to cues in the face of changing reward value, leading to the 
hypothesis that learning mediated by the amygdala is related to the cur- 
rent, relative value of biologically significant outcomes*’*™*, This view is 
congruent with the broader notion that the amygdala provides informa- 
tion about the ‘state value’ of an organism, defined as the value of the 
overall situation of an organism at a given moment’. 

The BLA and CeA seem to make distinct contributions to the repre- 
sentation of value, as revealed through procedures designed to change 
the current value of an outcome. BLA lesions impair the ability of value 
changes in specific reward outcomes to affect behaviour. Thus, the BLA 
is proposed to represent outcome value along with specific sensory fea- 
tures, allowing for discrimination among multiple outcomes ofa similar 
valence””””’, By contrast, the CeA is considered to maintain a more gen- 
eral representation of the motivational significance of an outcome™”. 

Just as BLA neurons develop excitatory responses to a CS paired with 
aversive outcomes, they show excitatory responses to auditory, visual 
or olfactory CSs paired with rewarding outcomes, typically sweet liq- 
uid or food pellets’. As initially proposed for fear conditioning, evi- 
dence suggests that reward cue responses develop through long-lasting 
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Figure 3 | Amygdalar circuits that are sufficient to alter behaviour ina 
diversity of domains. Projection-specific effects as shown by optogenetic or 
pharmacogenetic manipulation. The solid or dotted lines indicate the promotion 
or inhibition of certain behaviours. The basolateral complex of the amygdala 
(BLA) encompasses the lateral and basal nuclei. Specific cell types are shown in 
pink. For simplicity, projections that are anatomically or electrophysiologically 
defined but have not been shown to have a causal relationship with behaviour 
are omitted. This is a selective picture of projections that have been directly 
manipulated, and is not meant to signify their importance over other anatomical 
connections. The actual connectivity of the amygdala with other brain regions 

is considerably more complex. AC, auditory cortex; adBNST, anterodorsal bed 
nucleus of the stria terminalis; CeA, central nucleus of the amygdala; CeL, lateral 
CeA; CeM, medial CeA; D1R, dopamine 1 receptor; EC, entorhinal cortex; Hyp, 
hypothalamus; IL, infralimbic; MGN, medial geniculate nucleus; mPFC, medial 
prefrontal cortex; NAc, nucleus accumbens; OT, oxytocin; ovBNST, oval nucleus 
of the BNST; PBN, parabrachial nucleus; PKC, protein kinase C; PL, prelimbic; 
SOM, somatostatin; vVHPC, ventral hippocampus. 


enhancement of glutamatergic inputs from sensory thalamus onto princi- 
pal neurons in the LA”. Of note, the acquisition of conditioned respond- 
ing to both fear and reward cues requires an NMDA-receptor-dependent 
increase in AMPA-receptor function in LA neurons“**”*. The end result 
of this synaptic potentiation is that the cue is able to drive spiking in LA 
neurons, which in turn activates neuronal populations in downstream 
regions that contribute to the cue-triggered behaviour. Clearly, learning 
about environmental stimuli that predict the occurrence of food and other 
rewards is adaptive, as is learning about potential aversive outcomes. Thus, 
learning across a valence continuum of positive (rewarding) to negative 
(punishing or aversive) outcomes engages the amygdala. 


Amygdala neurons encode valence 

Because both fear and reward cues recruit BLA neurons, these findings 
raise the question of how processing of fear and reward cues by amyg- 
dala networks is organized. For example, would neurons with excita- 
tory responses to fear-predictive cues also show excitatory responses to 
reward-predictive cues, or is there segregation of positive- and negative- 
valenced signals? This question was addressed by directly comparing 
neural responses in the same subjects following training on both appeti- 
tive and aversive tasks. These within-subject comparisons in rodents and 
non-human primates consistently reveal populations of valence-selective 
neurons” * such that some neurons excited bya fear cue do not respond 
to areward cue, or show inhibition in the presence of the reward cue, and 
vice versa. By training subjects on two parallel cue outcome associations, 
with one cue followed by a rewarding outcome, and the other followed 
by an aversive outcome, and then reversing the cues assigned to each 
outcome, it was revealed that a substantial proportion of cue-selective 
neurons encode the outcome with which it is currently paired, not the 
sensory features of the cue itself*’*’. Outcome-specific neuronal popula- 
tions are consistent with valence-sensitive neuronal populations that are 
responsive to one valence, but not another. The finding that different BLA 
populations respond to a fear cue after learning than after extinction, 
when that cue signals no footshock*, can also be interpreted as valence 
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encoding. The observance of valence-specific neuronal activity is congru- 
ent with the notion that the amygdala is concerned with the relative value 
of the outcome, the occurrence of which is signalled by the cue. 
Notably, valence-encoding is complemented by salience-encoding; 
some BLA units show excitatory responses to both fear and reward 
80.8585" and these responses are correlated with measures of auto- 


cues 
nomic nervous system activation™. The salience of a stimulus is defined 
as the intensity of a stimulus and is a second dimension along which 
stimuli are encoded, in line with common models of emotion’. The 
salience responses may contribute to processes of arousal and attention 
that enhance processing within the amygdala or in target regions. This 
may be reflected behaviourally in better performance in real-time and in 
enhanced learning. A role in signalling the salience of stimuli is in agree- 
ment with the suggested contributions of the BLA in attention” and in 
enhancing memory storage in downstream regions”. The amygdala 
projects to basal forebrain cholinergic systems and midbrain dopamin- 
ergic systems, two means by which these effects may be mediated. For 
example, CeA projections to midbrain dopaminergic regions are required 
for conditioned orienting”. Furthermore, projections to sensory cortical 
regions, including primary visual cortex in primates”, may allow amyg- 
dala attentional signals to modulate stimulus processing™. In addition, 
findings suggest that attention modulates amygdala valence signals”. 
What might lend a cell its functional phenotype? No obvious ana- 
tomical segregation of neurons that are sensitive to the reward or fear 
cue has been detected with electrophysiology*”*, demonstrating that 
neurons encoding two very different signals are intermingled within 
this structure. But wiring must still be fundamental to cell phenotype. A 
given BLA principal neuron, for example, is likely to be associated with 
reward or fear by virtue of its distinct connectivity, including both the 
pattern of specific inputs to that neuron and its projections to effector 
regions for expression of the adaptively appropriate behaviour. Amyg- 
dala neuronal pairs sensitive to stimuli of the same valence are more 
likely to show correlated neural activity than neuronal pairs sensitive to 
opposite valences, supporting the idea of functional networks”. 


Membership in a memory network 
How do these functional networks for fear, reward or anxiety arise? 
Molecular genetic studies in mice have investigated the size, stability and 
required initial conditions for the formation of memory ensembles after 
fear conditioning. Using expression of the activity-dependent gene Arc 
to visualize activated LA neurons, the proportion of principal neurons 
participating in the fear memory trace is estimated to be 15-25%”. 
Although there may be a larger deterministic population, final member- 
ship in the trace can be biased by enhancing the activity level of individual 
LA neurons by multiple methods, including CREB overexpression” and 
brief optical activation immediately before tone—footshock pairing, which 
takes advantage of the temporal precision of optogenetics to prove that 
activity enhancement need only be present immediately before train- 
ing”. Selective ablation after training of CREB-overexpressing neurons 
abolished conditioned freezing, providing strong evidence that these 
neurons are recruited into the memory trace”. Expressing the excita- 
tory DREADD hM3Dgq ina sparse population of LA neurons and acti- 
vating these receptors during fear conditioning enhanced fear memory 
and biased the inclusion of these neurons into the fear trace. When the 
hM3Dgq receptors are activated without presentation of the CS, the condi- 
tioned freezing response is partially recapitulated”. Importantly, similar 
mechanisms act for reward learning; the acquisition and maintenance of 
a cocaine-conditioned place preference depends on the recruitment ofa 
small population of neurons in the LA’. These studies provide evidence 
that subsets of LA neurons stably participate in fear and reward memory 
traces, and there is similar evidence for ensembles in the BA”. 
Although there do seem to be similarities in the cellular mechanisms 
that promote inclusion into memory networks, does that mean that any 
LA neuron could be either a ‘fear’ or ‘reward’ neuron, or are there con- 
straints on ensemble membership? A recent study” provides evidence 
that valence-specific ensembles in the BLA (including the LA), once 
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formed, are immutable, which does not directly demonstrate that there 
are anatomical constraints on initial ensemble formation, but is congruent 
with that possibility. This study used ‘memory trace tagging’ approaches 
to determine whether a given neuron is limited to representing one 
valence or shows equipotentiality. ChR2 expression was limited to BLA or 
HPC neurons in the dentate gyrus (DG) that expressed c-fos on either fear 
(footshock) or reward (mate exposure) conditioning, thereby labelling a 
network of neurons that presumably constitute part of the memory trace. 
Later, optical activation of these networks supported either approach or 
avoidance following fear or reward conditioning, respectively, in a real- 
time place preference (RTPP) task, establishing the ‘valence of the labelled 
network. To examine the reversibility of valence assignment, a network of 
a given valence was activated during subsequent retraining with outcomes 
of the opposite valence. Whereas pairing an ensemble of DG neurons 
activated with one valence with the opposite valence US could producea 
‘switch in the valence of the tagged ensemble, this was not true for BLA 
ensembles’. It seems likely that this ‘fixed valence’ feature derives from 
the distinct wiring of BLA neurons into positive- and negative-valenced 
networks, and is in agreement with the observations made within the 
awake recording studies already discussed. Together, these findings raise 
the question of what the cellular identity and anatomical connectivity of 
those positive or negative valence ensembles might be. 

Although reward and fear networks in the BLA seem to occupy an 
overlapping spatial location, a recent study using c-fos activation patterns 
paints a different picture for the CeA. By taking advantage of the different 
time courses for stimulation of c-fos mRNA and protein production, the 
CeA neuronal populations activated by two oppositely valenced stimuli 
were visualized using fluorescence immunohistochemistry and in situ 
hybridization within mice. Neurons activated by morphine were located 
primarily in the CeL, whereas neurons activated by footshock were mainly 
found within the CeM™. Although this study provides intriguing new 
information on the activation of CeA populations in a valence-specific 
manner, earlier studies already reviewed indicate that neurons within 
both the CeL and CeM contribute to diverse behaviours, so the functional 
implications of this result remain to be determined. 


Valence encoding in BLA inhibitory networks 

An important key to understanding neural diversity is determining how 
individual neurons act within a circuit, and electrophysiology provides 
one means to query the role of a neuron. However, a drawback of in vivo 
extracellular recording studies is that the neuronal subtype, such as 
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projection neuron or interneuron, can only be inferred from physiologi- 
cal measures. The ability to target opsin expression in defined neuronal 
subpopulations, in combination with electrophysiology, is overcoming 
this issue and has allowed us to gain new understanding of the inhibi- 
tory processes in the amygdala. 

As described above, interactions among CeL inhibitory neurons, 
including inhibitory neurons that send projections outside the CeL 
itself, are crucial for the production of behavioural measures of fear. 
By contrast, in the BLA, inhibition entails suppression of excitatory 
principal projection neurons by local inhibitory interneurons and inter- 
calated neurons. A recent study harnessed the power of optogenetics to 
study inhibitory BLA interneurons by temporally restricting excitation 
or inhibition of parvalbumin (PV) or SOM interneurons to the US foot- 
shock or the auditory CS, and observing the effect on fear condition- 
ing’. Intriguing evidence of two different local circuit mechanisms that 
disinhibit principal neurons was found: during US presentation, PV 
neurons that provide perisomatic innervation of BLA principal neurons 
are inhibited by footshock, thereby directly disinhibiting BLA principal 
neurons; during the CS, PV neurons inhibit SOM interneurons that 
selectively contact principal neuron dendrites to disinhibit input-driven 
activity (Fig. 5a). Unit recordings were made during behaviour, and 
comparisons between these naturally occurring and optically evoked 
waveforms were used to convincingly ascribe neural correlates of behav- 
iour to spike activity of PV and SOM interneurons. The electrophysi- 
ological activity of a subset of these interneurons during the CS and 
the US was the same as predicted by the model constructed from the 
optogenetic manipulations, providing support for the proposed local 
circuit mechanisms. Thus, biologically significant stimuli and the cues 
that predict them change the firing of inhibitory interneurons, the activ- 
ity of which gates the responsiveness of principal neurons. 

Interestingly, the in vivo activity of PV and SOM interneurons is not 
limited to the role suggested by the above model. Of note, in vivo record- 
ings found that a substantial subset of interneurons responded to the CS 
and US in an opposite way to that predicted by the model™, yet another 
example of neuronal response diversity. This is consistent with a view of 
parallel opposing circuits in the BLA, in which distinct populations of 
inhibitory neurons may govern the activity of BLA neurons mediating 
positive and negative valenced associations. 

Together, the findings support proposed roles for inhibitory neurons 
in shaping functional networks in the amygdala®””. Neuronal networks 
in the BLA may actively suppress other networks, the output of which 
would be incompatible with current behavioural requirements by feed 
forward inhibition onto a principal neuron through an intervening 
interneuron activated by a neighbouring principal neuron encoding 
a different valence or behaviour. To flexibly switch between different 
behavioural states, mutual inhibition among parallel competing net- 
works is an attractive solution, although there may be other means of 
interaction among competing networks (Fig. 5). It is probable that long- 
range excitatory synapses, for example from the cortex, onto inhibitory 
interneurons (or GABAergic intercalated populations”) are crucial for 
the suppression of opposing networks” (for simplicity, external afferent 
influences are not depicted in Fig. 5b). Other means to inhibit opposing 
networks include changes in synaptic input; extinction, for example, 
induces heterosynaptic inhibition of thalamic inputs onto LA neu- 
rons’. In addition, the effect of interneurons on principal neurons is 
dynamic. New evidence indicates that inhibitory synapses onto prin- 
cipal cells change with experience; extinction training increases PV™ 
perisomatic synaptic contacts on principal neurons that are activated 
during fear learning’”*, an example of how inhibition can remodel a 
network as the value of a cue changes. 


Valence, amygdala circuits and behaviour 

Given its access to primary sensory information, the amygdala seems 
well suited to rapidly process and transmit information regarding the 
positive or negative valence of stimuli to bias behaviour in an adaptive 
manner. The evidence reviewed above suggests that networks within 
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the amygdala are organized into distinct neural circuits for positive- 
and negative-valenced stimuli. How do these circuits come to affect 
behaviour? 

To answer this question, we must map functionality onto the ana- 
tomical connections within the amygdala itself and onto projections 
from the amygdala. For fear conditioning, functional circuit mapping 
is relatively well developed, as described above. In the case of reward- 
related behaviours, locomotion, directed approach and manipulation 
of objects in the environment in order to obtain or interact with the 
reward are usually required. These types of coordinated, but flexible 
action patterns, engage corticostriatal circuitry. The BLA projects 
robustly to the NAc'’’, which mediates motivated responding to 
reward-predictive cues for both natural and drug rewards; hence the 
BLA-NAc projection could allow information regarding the current 
value of cues to affect reward-related behaviour, as suggested by earlier 
findings®*”"*. These ideas can be directly assessed using projection- 
specific manipulations during behaviour. The first findings along these 
lines reported that the BLA-NAc projection is in itself sufficient to 
support positive reinforcement, as demonstrated by intracranial self- 
stimulation’®*'”’ and RTPP, in which entry into a particular spatial 
location triggered stimulation of BLA-NAc terminals’”. More recently, 
optical inhibition of the BLA—NAc projection (as well as the BLA-pre- 
limbic (PL) projection) was found to prevent reinstatement of respond- 
ing in reaction to a cocaine-paired cue in an animal model of relapse, 
further supporting a role for this projection in cue-reward associa- 
tions''’. These findings demonstrate that some BLA neurons involved 
in reward project to the NAc. Perhaps a subset of these receive efferents 
from the PFC; photoactivation of mPFC terminals in the BLA has been 
reported to increase instrumental responses for food’. Collectively 
these studies have begun to define amygdala circuits that contribute 
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to reward-related behaviour, although the circuit analysis is not as far 
along for reward as it is for fear. 

As function continues to be mapped onto amygdala circuitry, our 
understanding of how neural signals in the amygdala affect behaviour 
will continue to grow. A wide range of behavioural changes has been 
achieved by optogenetic or pharmacogenetic manipulation of the 
projections studied so far (Fig. 3). Two observations can be made: 
first, multiple projections leading from the amgydala affect a single 
behaviour, and second, behaviours of different or opposite valence 
are affected by projections between the same two brain regions. Both 
conditioned freezing and anxiety exemplify the first observation. In 
the case of anxiety, for example, the finding that three different BLA 
efferents influence anxiety-like behaviour’®*'” suggests that even at 
the level of the comparatively simple system of the amygdala, neural 
control of a single behaviour is not reduced to one serial pathway 
but is normally accomplished by multiple circuits, although these 
circuits potentially interact. The second observation that the same 
pathway may affect very different behaviours is exemplified by the 
role of the BLA-PL pathway in both fear and relapse to reward seek- 
ing. In each of these behavioural procedures’, photoinhibition of 
the pathway impaired the behavioural effects of the conditioned cue; 
this pathway may be valence-independent, or may carry mixed fibres. 
These possibilities can be distinguished using the types of approaches 
discussed here. 

This new functional map supports a view of the amygdala as a com- 
posite of parallel circuits that contribute to multiple behavioural states. 
Although there may be a substantial degree of overlap among the 
circuit elements, the circuits are differentiated by the specific details 
of their neural connections, both afferent and efferent, and the pat- 
terns of activation of those connections. Hence, to fully understand 
the relationship between neurons that transmit information about the 
valence of a stimulus and subsequent behaviour, neural circuit analysis 
is key. This way of thinking about the amygdala is different from past 
conceptions of it as a fear hub or as a circuit providing a readout of 
positive or negative affect in simple terms. Instead, the emphasis is on 
understanding the behaviourally relevant functions of paths of infor- 
mation flow through these regions, including how diverse, primarily 
sensory, inputs might interact locally to produce varied downstream 
functional effects. 

Although we make the case that the amygdala contributes to a 
diverse set of behaviours, as revealed by the fine-grained analysis of 
circuits, the notion of valence remains a useful heuristic. Sensitivity 
to the valence of a stimulus, of whether it is good or bad, is crucially 
important to ensure appropriate behavioural responses that promote 
approach and the acquisition of food, safety and social partners, and 
in alternative circumstances, that promote vigilance, avoidance or 
aggression towards threats. Each of the behaviours affected by amyg- 
dala manipulations falls somewhere along the continuum of ‘good’ 
to ‘bad’. We note that organizing adaptive behaviours only along the 
single dimension of valence is certainly an oversimplification, but 
this may capture an essential feature of these diverse behaviours that 
engages the amygdala. 


Looking forward 
Recently, we have made great strides in delineating the functional 
microcircuitry of the amygdala using the new technologies of optoge- 
netics, pharmacogentics and viral-based tract tracing that take advan- 
tage of gene-targeting. Crucially, these approaches have been used in 
partnership with state-of-the-art electrophysiology and careful behav- 
ioural analysis. This programme of circuit analysis is equally applica- 
ble to the study of other neural systems and is a way forward towards 
deeper knowledge of the functions that emerge from neural circuits. 
As we look to the future of research on amygdala circuits, we con- 
sider areas that deserve attention. It is important to define amygdala 
functional microcircuitry in preclinical models of human behavy- 
ioural disorders. Animal and human studies implicate the amygdala 
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in anxiety disorders, autism and addiction. In the case of addiction, 
chronic alcohol use alters neural transmission in the CeA, and these 
changes have been linked to excessive alcohol use'*, Furthermore, 
alcohol and drug seeking triggered by auditory cues requires the 
BLA'!*"!®, whereas memory traces related to the smell and taste 
of alcohol that drive relapse are stored in the CeA’”. In the case of 
autism, understanding the mediation of social interaction by the 
amygdala is highly relevant**'"*, adding momentum to this line of 
research®. 

Of great interest is determining how information from the amyg- 
dala affects downstream cortical circuits. Much research indicates 
that neural connectivity with the orbital prefrontal cortex is impor- 
tant for updating cue values after changes in their associated out- 
come®*!!*”° Tn addition to the orbital prefrontal cortex, BLA neurons 
project strongly to the medial PFC. The function of BLA signals in 
these cortical regions is less likely to be closely tied to discrete behav- 
ioural output. Instead, these BLA-cortical projections are proposed to 
mediate the impact of Pavlovian associations on decision making”. 
In addition, electrophysiological recordings have uncovered dynamic 
experience-dependent interactions between the amygdala, PFC and 
HPC in the entrainment of oscillations of different frequencies”, 
but their causal role in behaviour is not clear. Selective projection 
manipulations may reveal the circuit and behaviour impacts of these 
long-range interactions. New studies manipulating specific amygdala 
projections to the cortex are beginning to address these issues'!!""!* 

Increasingly, the model systems available for circuit analysis will 
include primates as genetically accessible non-human primate models 
are developed and viral-mediated tools for neural manipulation are 
optimized, allowing for further investigation of species similarities 
and differences in amygdala function. Moving forward, it is crucial 
to complement causal manipulations with measurements of neural 
activity during a wide variety of behaviours across multiple species. 

Although we champion the ability to manipulate circuit compo- 
nents that can be isolated with genetic or anatomical features, the 
existing tools still have limitations that prevent a comprehensive 
understanding of these circuits. Genetically encodable tools for neu- 
ral manipulation allow far greater targeting specificity than before, 
but it is unlikely that all of the neurons that project from one region 
to another, or that share a genetic marker, have identical functions. 
Thus, we may still be observing a ‘majority vote’ for a given behav- 
ioural readout when manipulating any circuit component, and only 
more specific targeting strategies will reveal the functional minority 
populations. Along these lines, the synchrony and timing of most 
photostimulation experiments are not physiological and could dis- 
turb important rhythmic interactions across distal networks in ways 
we do not understand. To solve these issues, we need a greater library 
of molecular markers and tools to target combinatorial expression 
patterns, and we need the means to induce more naturalistic activity 
patterns in neurons. Basic science insights into molecules, synapses, 
cells and circuits will need to be synthesized to achieve this level of 
understanding. = 
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The biology of innate 


lymphoid cells 


David Artis' & Hergen Spits? 


The innate immune system is composed of a diverse array of evolutionarily ancient haematopoietic cell types, includ- 
ing dendritic cells, monocytes, macrophages and granulocytes. These cell populations collaborate with each other, with 
the adaptive immune system and with non-haematopoietic cells to promote immunity, inflammation and tissue repair. 
Innate lymphoid cells are the most recently identified constituents of the innate immune system and have been the focus 
of intense investigation over the past five years. We summarize the studies that formally identified innate lymphoid cells 
and highlight their emerging roles in controlling tissue homeostasis in the context of infection, chronic inflammation, 


metabolic disease and cancer. 


taneously facilitate peaceful cohabitation with the trillions of 

beneficial microorganisms that constitute the microbiota, to 
provide host defence against infectious agents, and to initiate the 
repair and remodelling processes that restore and maintain tissue 
homeostasis’”. Groundbreaking studies over the past five years for- 
mally identified innate lymphoid cells (ILCs) as part of the innate 
immune system; these cells can directly communicate with a wide 
variety of haematopoietic and non-haematopoietic cells to orches- 
trate immunity, inflammation and homeostasis in multiple tissues 
throughout the body*>. 

ILCs are a distinct arm of the innate immune system that are 
regulated by multiple endogenous mammalian cell-derived factors 
including neuropeptides, hormones, eicosanoids, cytokines and other 
alarmins* >. The specialized distribution of ILCs in lymphoid and 
non-lymphoid tissues across multiple species, coupled with their 
functional heterogeneity’, has provoked a fundamental reassess- 
ment of how ILCs integrate innate and adaptive immune responses 
and control diverse physiological processes. In this Review, we sum- 
marize the recent findings of multiple groups that converged on the 
identification of ILCs and discuss our current understanding of the 
developmental and functional heterogeneity of this cell population. 
Furthermore, we highlight the emerging roles of ILCs in control- 
ling tissue homeostasis in the context of infectious diseases, chronic 
inflammation, metabolic homeostasis and cancer. 


r | Vhe innate and adaptive immune systems have evolved to simul- 


Definition of ILC subsets 

The ILC family encompasses not only classic cytotoxic natural killer 
(NK) cells (which were discovered’ in 1975 and are involved in pro- 
tection against certain viruses and tumours) and lymphoid tissue 
inducer (LTi) cells (which were discovered’ in 1997 and promote the 
formation of lymph nodes during embryogenesis), but also more 
recently described non-cytotoxic ILC populations. All members of 
the ILC family are characterized by a classic lymphoid cell morphol- 
ogy, but lack the expression of cell-surface molecules that identify 
other immune cell types, and so are defined as cell lineage marker- 
negative (Lin) cells**”""'. ILCs express subunits of cytokine receptors 
including interleukin (IL)-2 receptor-a (also called CD25) and IL-7 
receptor-a (also known as CD 127), but unlike adaptive T and B lym- 
phocytes, ILCs lack expression of somatically rearranged antigen 


receptors and so do not exhibit any degree of antigen specificity”. 
Recent analysis of the developmental pathways of the ILC family 
members indicates that NK cells and non-cytotoxic helper ILCs are 
separate lineages. The non-cytotoxic ILCs consist of three distinct 
groups: group 1 ILCs (ILC1s), group 2 ILCs (ILC2s) and group 3 
ILCs (ILC3s) including LTi cells*’*"“ (Fig. 1). As described later, the 
non-cytotoxic ILC subsets are defined on the basis of their differential 
requirements for transcription factors during development, their pat- 
terns of expression of effector cytokines and the acquisition of other 
distinct effector functions™. 

ILC1s are largely non-cytotoxic Lin’ cells that are capable of pro- 
ducing interferon-y (IFNy) and tumour necrosis factor (TNF) and 
that have been implicated in immunity to intracellular bacteria and 
parasites’*'*”. By contrast, ILC2s produce T helper 2 (T,,2)-cell- 
associated cytokines (including IL-4, IL-5, IL-9 and IL-13) and/ 
or the epidermal growth factor receptor ligand amphiregulin and 
promote type 2 inflammation required for anti-helminth immunity, 
allergic inflammation and tissue repair® ’°’*”°. Finally, depending 
on the stimulus, ILC3s produce IL-17A, IL-17F, IL-22, granulocyte 
macrophage (GM) colony-stimulating factor (CSF) and TNF and 
can promote antibacterial immunity, chronic inflammation or tissue 
repair’ *°, ILC3s are heterogeneous in both mice and humans. In 
mice, two subsets can be distinguished on the basis of their expression 
of the chemokine receptor CCR6. CCR6* ILC3s encompass CD4° and 
CD4 LTicells. The CCR6 ILC3 population consists of two subpopu- 
lations that can be distinguished based on expression patterns of the 
natural cytotoxicity receptor (NCR) NKp46. In humans, almost all 
ILC3s express CCR6 and CD117 (also known as c-kit), and at least 
two subsets can be distinguished on the basis of the expression of the 
NCR Nkp44 (refs 21-23). Strikingly, the ILC1, ILC2 and ILC3 sub- 
sets exhibit remarkable functional similarity with T-helper-cell sub- 
sets in terms of cytokine expression and potential effector function, 
although these cells perform their diverse functions in the absence 
of antigen specificity. 


Developmental requirements for ILCs 

It is well established that all lymphocytes arise from a common lym- 
phoid progenitor (CLP) that differentiates into precursors that are 
committed to particular cell lineages'*”°”’. Whereas precursor cell pop- 
ulations committed to the T- and B-cell lineages have been extensively 
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Figure 1 | The innate lymphoid cell family. Group 1, group 2 and group 3 
innate lymphoid cells (ILCs) are defined by differential expression of cell- 
surface markers, transcription factors and patterns of expression of effector 
cytokines. ILCs can be activated by a diverse range of stimuli including 
neuropeptides, hormones, eicosanoids, cytokines and other alarmins, and can 
contribute to immunity, inflammation and maintenance of tissue homeostasis. 


characterized, CLP-derived committed ILC precursors have only 
recently been identified. Precursors that can develop into all ILC 
subsets and NK cells (but not into T cells and B cells) are contained 
within a population that seems to be very similar to that of CLPs but 
that express the integrin a467, referred to as a-lymphoid precursor 
(aLP) cells. aLP cells expressing the chemokine receptor CXCR6 are 
heterogeneous and can develop into conventional NK (cNK) cells and 
ILC3s, but not T cells or B cells and may include the common ILC 
and cNK cell precursors’*”*”’. Downstream of these cells are at least 
two precursors that express the transcriptional repressor Id2, which 
can develop into ILCs and NK cells (see “Transcriptional regulation 
of development and function’)'*"*”*. One of these Id2* ILC precursor 
populations expressed CD 127 and the integrin «487, and could be 
distinguished from CLPs by the absence of FLT3 and CD93 (ref. 13). 
The Id2* ILC precursors could develop into ILC1s, ILC2s and ILC3s, 
including LTi cells, but were unable to develop into cNK cells!®. 
Another ILC precursor was identified on the basis of expression 
of the transcription factor promyeloid leukaemia zinc finger (PLZF; 
encoded by Zbtb16), which is important for the maturation of NK T 
cells***’, PLZF*CD3¢° cells were found in fetal liver and adult bone 
marrow, expressed CD127, «487, Thyl and CD117, and could give 
rise to CD127* ILC1s, ILC2s and ILC3s in vitro and in vivo but were 
unable to develop into LTi cells or cNK cells". These data suggest that 
Id2*PLZF  LTi cell or ILC precursors are upstream of an Id2*PLZF* 
ILC precursor population. Thus, our present knowledge indicates 
that there are at least three populations of ILC precursors with pro- 
gressively limited precursor potential (Fig. 2). A bone-marrow- 
resident precursor that is committed to the ILC2 lineage has also 
been identified**”’, and there is also evidence for the existence of 
precursors of committed ILC1 and ILC3 subsets in mice’*”*™*. Further 
research will be required to refine the precursors and developmental 
checkpoints in the formation of the three ILC subsets, and additional 
analyses of people with primary immunodeficiencies may provide 
more insight into the developmental requirements of human ILCs. 


Transcriptional regulation of development and function 

The transcription factors and related molecules that control the devel- 
opment of ILCs are being identified at a rapid pace, building on our 
knowledge of transcriptional control of T-helper-cell subsets. Two tran- 
scription factors that affect early differentiation of ILCs and NK cells 
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Dysregulated ILC responses can also contribute to chronic inflammatory 
diseases, metabolic disorders and cancer. AHR, aryl hydrocarbon receptor; 
Areg, amphiregulin; GM-CSF, granulocyte macrophage colony-stimulating 
factor; IFNy, interferon-y; IL, interleukin; LTi, lymphoid tissue inducer; NCR, 
natural cytotoxicity receptor; NK, natural killer; TNE tumour necrosis factor; 
TSLP, thymic stromal lymphopoietin. 


have been identified, TOX and NFIL3 (refs 14, 16, 35-38). These tran- 
scription factors do not affect early development of T cells and B cells, 
although TOX is important at later stages of T-cell development for 
the differentiation of CD4* T cells”. Id2 is required for the develop- 
ment of ILCs at an early stage, as Id2-deficient mice lack ILCs and NK 
cells*”’*?*°, One pathway by which Id2 may allow ILC development 
to proceed is through sequestration of the basic helix-loop-helix 
transcription factor E47, which was shown to block development 
of LTi and NK cells”, although it has yet to be determined whether 
this pathway also functions in the development of ILC1s and ILC2s. 
It should be noted that although Id2 is required for optimal develop- 
ment of NK cells, this factor seems to act at a later developmental 
stage in the transition of pre-NK to immature NK cells”. 

TCF-1 and GATA3 also seem to be key transcription factors that 
drive the development of all CD127* ILCs”. Genetic ablation of 
GATA3 in haematopoietic stem cells (HSCs) resulted in the inhibi- 
tion of all ILC subsets and T cells but not of NK cells or B cells'”. 
Deletion of GATA3 in mature ILCs affected ILC2, but not ILC3 
populations'******, suggesting that GATA3 is crucial for the post- 
development maintenance and survival of ILC2s. Other recent data 
indicate that multiple transcription factors regulate the development 
and function of ILC subsets. For instance, the transcription factor 
GFI1 might contribute to the function of GATA3 in ILC2s; this factor 
targets the Gata3 gene, and levels of GATA3 expression were reduced 
in ILC2s deficient in GFI1 (ref. 47), providing a potential mechanism 
through which GATA3 expression is maintained in mature ILC2s. 
These findings indicate that the influence of GATA3 and associated 
transcriptional regulatory networks are dependent on the develop- 
mental stage of the ILC precursors, similar to their role in the devel- 
opment of T cells**. 

The dependence of NK cells, ILC1s and ILC3s on shared transcrip- 
tion factors for their development and function further emphasizes 
the remarkable functional similarity of ILCs with corresponding 
T-helper-cell subsets. For example, eomesodermin and T-bet have 
both redundant and non-redundant roles in the development, func- 
tion and migration of NK and CD8* T cells’, whereas the master 
transcription factor of T,;1 cells, T-bet™, is important for the devel- 
opment and IFNy-producing function of non-cytotoxic ILC1s. It 
should also be noted that T-bet is required for upregulation of NKp46 
expression and IFNy production by CCR6 ILC3s’**"**, RORyt is 
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indispensable for the development of all ILC3 subsets’*”*””, as it is 


for T,,17 cells™. In addition, the aryl hydrocarbon receptor (AHR), a 
ligand-activated transcription factor that acts as a sensor for multiple 
exogenous and endogenous compounds including toxins, tryptophan 
metabolites, dietary products and bacterial pigments, controls the 
survival and function of ILC3 subsets*”* and Ty17 cells*”. Finally, 
Notch signalling, which is required for T-cell development, has also 
been implicated in the development of ILC3s**”*”* and ILC2s****. It 
is important to note that there are also transcription factors that act 
on ILC subsets but not on the corresponding T-helper cell subset. For 
example, RORa is important for ILC2s but does not seem to play a 
T-cell intrinsic part in T,,2 cell development and function”. 

As with T-helper-cell subsets, there is some evidence that ILCs 
can show functional plasticity in response to environmental cues. In 
mouse models it was shown that the function of ILC3s can be affected 
by a gradient of expression of the transcription factors RORyt and 
T-bet*’. Following activation by cytokines such as IL-12 and IL-18, 
ILC3s exhibit increased expression of T-bet and decreased expres- 
sion of RORyt, resulting in increased IFNy production and loss of 
their capacity to produce IL-17 and IL-22 (refs 17, 51). These murine 
cells, termed ex-RORyt* ILC3s, demonstrate ILC1-like function. In 
humans, an apparently similar switch in ILC3 to ILC1 effector func- 
tion has been observed'**’. Whether human or mouse ILC2s exhibit 
plasticity in their effector functions and what the functional signifi- 
cance of this might be remains unclear at present. 


Heterogeneous functions of ILCs 
Emerging studies have identified diverse functions for ILCs, including 
promoting host defence against infection and regulating interactions 
with the microbiota. In addition, ILCs can orchestrate wound healing 
and tissue repair, whereas in other circumstances they can promote 
inflammation and tumour progression. 


ILCs promote immunity to infection 

ILCs are enriched at barrier surfaces that are common sites of colo- 
nization or invasion by pathogens. It is now clear that all ILC subsets 
can have an important role in innate immune responses to viruses, 
bacteria, fungi and both intracellular and extracellular parasites at 
these barrier surfaces. ILC recruitment to barrier tissues takes place 
during embryonic development and further migration of ILCs is 
likely to occur in the context of ongoing inflammation. Constitu- 
tively present within adult tissues, ILCs are poised for rapid activa- 
tion by host-derived cytokines and growth factors* *. Epithelial cells 
and myeloid cell lineages act cooperatively to sense infection and/ 
or tissue damage and produce cytokines and alarmins that mobi- 
lize the rapid recruitment of distinct ILC populations’. For example, 
IL-12, IL-15 and IL-18 activate NK cells and ILC1s!*"*!°, whereas IL-2 
(refs 60, 61), IL-4 (refs 62, 63), IL-25, IL-33 (refs 8—10,19, 64-67), 
thymic stromal lymphopoietin (TSLP)**™, IL-9 (refs 20, 69) and 
TLIA”” activate ILC2s. By contrast, IL-1 and IL-23 stimulate ILC3 
responses” (Fig. 1). 

The role of NK cells in killing virus-infected cells through gran- 
zyme and/or perforin-mediated cytolysis is well established and has 
been reviewed elsewhere”’. In the context of immunity to intracellular 
bacteria and parasites, there are important roles for ILC1s and ILC3s 
in host defence. For example, IFNy-producing ILC1s contribute to 
resistance to Salmonella enterica subsp. enterica serovar Typhimu- 
rium and Toxoplasma gondii infection in the intestine’*”’ (Fig. 3). 
In addition, before the development of adaptive immune responses, 
innate immunity to the extracellular Gram-negative bacterium Cit- 
robacter rodentium is critically dependent on ILC3-derived IL-22 
(refs 23, 74), which has an important role in promoting STAT3- 
dependent expression of antimicrobial peptides and maintenance 
of intestinal epithelial barrier function (Fig. 3)’*”°. Mice deficient 
in IL-22 exhibited exaggerated intestinal inflammation and ero- 
sion of the epithelial barrier and rapidly succumbed to infection”. 
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Figure 2 | Model of developmental pathways of innate lymphoid cells and 
conventional natural killer cells. Like all lymphocytes, innate lymphoid 

cells (ILCs) are derived from a common lymphoid progenitor (CLP). The 
common ILC or natural killer (NK) cell precursor is enclosed in a population 

of cells that have the same phenotype as the CLP but that also express the «487 
integrin. Downstream of the common ILC/NK cell precursor (preNK) is an 
Id2-expressing precursor that can give rise to all ILCs, including lymphoid tissue 
inducer (LTi) cells and an Id2*PLZF* precursor that is restricted to ILCs but is 
unable to develop into LTi cells. In mice, RORyt* ILC3s require T-bet to develop 
into interferon-y (IFNy)-producing cells. Cytokines that upregulate T-bet such 
as interleukin (IL)-12 and IL-18 lead to downregulation of RORyt, abrogation 
of IL-22 and increase of IFNy-producing capacity. These IFNy-producing ILC1- 
like cells are also called ex-RORyt* ILC3s. AHR, aryl hydrocarbon receptor. 


ILC3-derived IL-22 also works cooperatively with lymphotoxin to 
induce fucosylation of intestinal epithelial cells, contributing to 
resistance to S. Typhimurium infection”*. In addition, ILC3s play an 
important part in regulating host-commensal-bacteria relationships 
(see ‘Interactions of ILCs with the microbiota). The host protective 
effects of ILC3s are not limited to bacterial infection in the intestine. 
ILC3s have also been implicated as an important source of IL-22 in 
the lungs following infection with the fungus Candida albicans” or 
the bacterium Streptococcus pneumoniae’’, and an innate source of 
IFNy and IL-17 that resembles ILC3s in the lungs of mice subjected 
to a bacillus Calmette-Guérin (BCG) vaccination protocol pro- 
vided enhanced protection against challenge with Mycobacterium 
tuberculosis”. 

Whereas ILC1s and ILC3s promote innate immunity to viruses, 
intracellular bacteria and parasites, and fungi (Fig. 3), ILC2s are 
essential to the promotion of type 2 inflammation required for 
immunity to some extracellular helminth parasites’**. The type 2 
inflammatory response is characterized by production of cytokines, 
including IL-4, IL-5, IL-9 and IL-13 that regulate the alternative acti- 
vation of macrophages, granulocyte responses, goblet cell hyperplasia 
and smooth muscle contractility that promote parasite expulsion and 
associated tissue-repair processes’. In studies of the mouse-adapted 
intestinal nematode parasite Nippostrongylus brasiliensis, ILC2s were 
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Figure 3 | Host-protective effector functions of innate lymphoid cells 

at barrier surfaces. In addition to their functions in lymphoid tissue 
development and metabolic homeostasis, innate lymphoid cells (ILCs) can 
orchestrate multiple antimicrobial effector functions at barrier surfaces 

in the context of exposure to viruses, bacteria, protozoa and helminths. 
Both natural killer (NK) cells and ILC1s produce interferon-y (IFNy) and 
contribute to protective immunity against viruses, intracellular bacteria and 
protozoan parasites. Production of type-2 cytokines, including interleukin 
(IL)-4, IL-5, IL-9 and IL-13 promote alternative activation of macrophages, 
eosinophilia, goblet-cell hyperplasia and smooth-muscle contractility that 
contribute to expulsion of helminth parasites. ILC3s produce IL-17A, IL-22, 
lymphotoxin (LT) and tumour necrosis factor (TNF) and contribute to 
control of extracellular bacterial infection. 


identified as the dominant non-T-cell source of IL-13, which is cru- 
cial for the expulsion of N. brasiliensis*"'°. 1LC2-deficient mice failed 
to efficiently expel their parasites, but adoptive transfer of wild-type 
ILC2s into ILC- or IL-13-deficient mice was sufficient to restore effi- 
cient worm expulsion, suggesting that these cells are crucial for the 
development of protective immune responses to N. brasiliensis in 
the intestine**. 

During N. brasiliensis infection, IL-25 and IL-33 (refs 8-10), the 
signalling adaptor molecule Act] (ref. 80) and the transcription fac- 
tors GATA3 (ref. 81), TCF-1 (ref. 42) and GFI1 (ref. 47) have been 
shown to be essential for the population expansion and IL-13 expres- 
sion by ILC2s that contribute to worm expulsion. These data sug- 
gest that a complex network of regulatory factors promote optimal 
ILC2 responses that directly contribute to protective immunity to 
helminths. In addition, recent work has shown that ILC2s interact 
with adaptive immune cells to indirectly promote protective type 2 
immune responses. For example, ILC2s express major histocompat- 
ibility complex (MHC) class II and can activate T cells (albeit less 
efficiently than dendritic cells) to induce IL-2, which in turn elicits 
ILC2 proliferation and production of T,,2-associated cytokines that 
promote worm expulsion’. Thus, ILC2s participate in both innate 
and adaptive immune responses to directly and indirectly facilitate 
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expulsion of N. brasiliensis. There is evidence that ILC2 popula- 
tions also expand and contribute to protective immunity following 
infection with other helminth species, although the role of ILC2s 
in expulsion of these helminths remains less well defined. ILC2s 
can express amphiregulin”, which is required for immunity to the 
gastrointestinal nematode parasite Trichuris muris®, and IL-33 pro- 
motes the expansion of ILC2-like cells in the lungs of mice infected 
with the helminth Strongyloides venezuelensis®. Collectively, these 
studies provide the foundation for further investigation required to 
comprehensively define the mechanistic role of ILC2s in immune 
responses that coordinate protective immunity to diverse species of 
helminth parasites. 


Interactions of ILCs with the microbiota 

In contrast to their role in promoting antimicrobial responses to patho- 
gens, ILCs also regulate interactions between the host and the diverse 
array of commensal bacterial species that constitute the microbiota. 
ILCs do so by regulating non-haematopoietic and haematopoietic cell 
functions to limit inappropriate immune responses to commensal 
bacteria**. Although the accumulation of most ILC populations in the 
murine intestinal tissues and gut-associated lymphoid tissues can occur 
independently of colonization by the microbiota’**** **, ILC3s seem to 
have a crucial role in the anatomical containment of lymphoid tissue- 
resident commensal bacteria**. For example, in the intestine, loss of 
ILC3s was associated with reduced expression of IL-22 and lower levels 
of antimicrobial peptides expressed by intestinal epithelial cells. These 
effects were coincident with the dissemination of bacteria that belong 
to the genus Alcaligenes and the development of low-grade systemic 
inflammation. Intestinal ILC3s also interact with various immune 
cells to prevent dissemination of commensal bacteria or limit inappro- 
priate immune responses to them. For example, ILC3s can activate B 
cells through membrane lymphotoxin a1 62 and induce the production 
of immunoglobulin A (IgA) that subsequently regulates dendritic cell 
activity and promotes immunological exclusion of commensal bacte- 
ria*’. These data suggest that ILC3-dependent maintenance of intesti- 
nal epithelial and immune-cell responses limits the dissemination of 
select commensal bacterial species and maintains appropriate separa- 
tion between inflammatory microbial-derived products and the host 
immune system. 

ILC3s can also promote an immunologically tolerogenic state in 
the intestine that limits the magnitude of potentially damaging T-cell 
responses against commensal bacteria. ILC3s express MHC class I and 
are able to process and present antigens to T cells, although they lack 
expression of the co-stimulatory molecules required to activate T-cell 
responses”. In this context, depletion of ILC3s or selective deletion of 
MHC class II on ILC3s was associated with exaggerated commensal- 
bacteria-specific T,,17 cell responses and, in some circumstances, the 
development of intestinal inflammation, indicating that intestinal ILC3s 
dampen commensal bacteria-specific T-cell responses in an MHC- 
class-II-dependent manner”***”. The antigen-presenting capacity of 
ILC3s may depend on the tissue and activation state of the cell because, 
in contrast to intestinal ILC3s, spleen-derived ILC3s that are cultured 
on feeder cells ex vivo can express co-stimulatory molecules and prime 
CD4'T cells in vitro”. In addition, T cells can regulate the population 
size and function of ILC3s in the intestine”’, suggesting that dynamic 
cross-regulation exists between T cells and ILC3s in this tissue site. Pro- 
duction of GM-CSF is another mechanism by which ILC3s can promote 
a tolerogenic state. A recent study demonstrated that cues from the 
intestinal microbiota elicit GM-CSF from ILC3s to promote intestinal 
homeostasis through enhancing dendritic cell and regulatory T-cell 
function”. Human and murine ILC3s can regulate the proliferation of 
B cells, particularly of marginal zone B cells”, that could also contribute 
to immunological exclusion and anatomical containment of commensal 
bacteria. Collectively, the effects of ILC3s on the epithelium, dendritic 
cells, T cells and B cells supports the capacity of the mammalian host to 
tolerate colonization by the microbiota. 
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ILC-mediated tissue remodelling, healing and repair 

ILCs also contribute to the maintenance of tissue homeostasis by con- 
tributing to tissue remodelling, wound healing and repair processes 
at multiple tissues sites**. LTi cells, a subset of ILC3s, have long been 
recognized for their ability to promote tissue modelling in the fetus 
and after birth®”"’. During embryonic development, LTi cells promote 
the formation of secondary lymphoid organs such as Peyer’s patches in 
the gut'’. LTi cells produce LTa12 that binds to LTBR on stromal cells, 
resulting in stromal cell secretion of chemokines (CXCL13, CCL21 and 
CCL19) and upregulation of adhesion molecules (VCAM1, MadCam1 
and ICAM1) that attract and bind leukocytes to ultimately form lym- 
phoid structures”. After birth, LTi cells use the LTal2-LTBR pathway 
to aid in the formation of isolated lymphoid follicles, structures that are 
important for immune reactions in the gut”. 

Other tissue remodelling functions of ILCs include wound healing 
and repair of damaged tissues. ILC3s have been implicated in repair 
of lymphoid tissues damaged following acute viral infection and an 
ensuing cytotoxic T-cell response against the virus-infected lymph 
node stromal cells”. IL-22-producing ILC3s also promote tissue 
repair and regeneration in the inflamed intestine” and in radiation- 
damaged thymic tissue”. Furthermore, they can limit allergic airway 
hyper-responsiveness in the lung”, indicating that ILC3s participate 
in the maintenance of homeostasis in multiple tissues following 
inflammation or damage. In addition, ILC2s express amphiregulin, 
a member of the epidermal growth factor family, and can restore 
bronchial epithelium that has been damaged by infection with influ- 
enza virus’ (Fig. 4). Together, these studies suggest that ILC2s and 
ILC3s contribute to the diverse processes of tissue remodelling that 
promote tissue repair and homeostasis. 


ILCs and chronic inflammatory diseases 

In addition to their ability to promote tissue homeostasis, ILCs can 
also promote inflammation at mucosal and barrier surfaces. In this 
context, chronic ILC activation can contribute to pathology in a wide 
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Figure 4 | Pro-inflammatory and tissue reparative functions of innate 
lymphoid cells. Innate lymphoid cells (ILCs) are amplified in a variety of 
inflammatory diseases that affect barrier functions, suggesting that they 
contribute to pathology. In the skin disorder atopic dermatitis, ILC2s increase 
in numbers. NKp44* ILC3s (NCR’) are amplified in skin lesions of people with 
psoriasis and produce interleukin (IL)-22, possibly contributing to acanthosis 
(skin thickening), which is characteristic for this disease. IL-17-producing 
ILCs may increase skin inflammation. Interferon-y (IFNy)- and IL-17- 
producing ILCs may contribute to inflammatory bowel disease in mice, and 
IFNy-producing ILC] are strongly amplified in inflamed intestinal tissues of 
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range of inflammatory disorders**. In mice, intra-epithelial ILC1s 
and IFNy-producing ILC3s can induce inflammation, and block- 
ing IFNy could ameliorate this disease in some models of colitis'*”. 
IFNy-producing ILCs may also be involved in human inflamma- 
tory bowel disease because the population of ILC1s expands and IL- 
22-producing ILC3s decrease in inflamed intestinal tissues of patients 
with Crohn’s disease’*'®. IL-17-producing ILC3s have been shown 
to play a part in inflammatory bowel disease in T-cell-independent 
mouse models™”®. In one of these models, loss of T-bet resulted in a 
substantial population expansion of IL-17-producing ILCs, driven by 
TNE and IL-23 produced by dendritic cells’®. In this context, CD127 
blockade reduced intestinal ILC3 numbers and attenuated disease”. 
Together, these findings indicate that ILC1s and ILC3s can contribute 
to the development of inflammation in the intestine (Fig. 4). 

ILC2s have been shown to be detrimental in a large variety of 
inflammatory disorders in experimental animals**. In mice with 
allergic lung inflammation, ILC2 populations expand in response to 
cytokines and alarmins, including IL-25, TSLP, IL-33, eicosanoids 
and TLIA produced by type 2 pneumocytes, alveolar macrophages 
and granulocytes; ILC2s also contribute to type 2 inflammation in 
the lung through production of IL-4, IL-5, IL-9 and IL-13 (refs 47, 70, 
71, 99-101). Recent work has also suggested that ILC2s coordinate 
with dendritic cells and CD4" T cells at mucosal sites to shape T-cell 
responses that contribute to allergic airway inflammation’. It is 
unclear to what extent ILC2s are involved in human lung inflam- 
mation, but it is striking that ILC2s are increased in the peripheral 
blood of people with asthma relative to those without the condi- 
tion’, and several genes associated with ILC2s are enriched in people 
with asthma, including the gene that encodes ILIRL1 (a component 
of the IL-33 receptor), RORa and IL-13, suggesting a role for ILC2s 
in disease pathogenesis’. Human ILC2s are robustly expanded in 
another type 2 inflammatory disease, chronic rhinosinusitis, which 
is characterized by pronounced eosinophilia and high IgE levels in 
serum'***. Epithelial cells of resected nasal polyps are capable of 
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patients with Crohn's disease. ILC3-derived IL-22 promotes proliferation of 
tumour cells in a mouse model of infection-induced colorectal cancer. IL-22 
binding protein (BP) secreted by dendritic cells can counteract the effect of 
IL-22. ILC2s cause airway hypersensitivity in a variety of mouse models of 
allergic asthma. Airway epithelial cells triggered by allergens produce thymic 
stromal lymphopoietin (TSLP), IL-25 and IL-33, which activate ILC2s to 
produce IL-5, IL-4, IL-9 and IL-13 that lead to airway hyper-reactivity. In one 
mouse model, ILC3s have been shown to dampen ILC2 hyper-reactivity. Lung 
ILC2s can mediate repair of tissue damaged by a virus through production of 
amphiregulin (Areg). 
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producing large amounts of TSLP and IL-33 that may lead to the 
enhanced IL-5 and IL-13 production by ILCs observed in patients 
with this disease'***, 

Another prominent type 2 disease is atopic dermatitis, an inflam- 
matory skin condition characterized by the presence of T},2 cells 
and high levels of IL-5 and IL-13 in the skin. Recent work has sug- 
gested that ILC2s might be involved in the pathogenesis of atopic 
dermatitis, because the ILC2 population is expanded in the lesions 
of patients with atopic dermatitis’. These cells are stimulated 
by TSLP® and are also likely to be stimulated by IL-25 and IL-33, 
because increased transcripts of these cytokines have been detected 
in atopic dermatitis skin lesions®*””*. Notably, the lectin inhibitory 
receptor KLRG1 may have a regulatory role in the control of ILC2 
activation during atopic dermatitis, because the KLRG1 ligand epi- 
thelial cadherin (E-cadherin) diminishes ILC2 cytokine production, 
and E-cadherin is downregulated in atopic dermatitis lesions”. Stud- 
ies in mouse models of atopic dermatitis support the idea that ILC2s 
can play a T-cell-independent part in establishing skin lesions™. In 
addition, ILC2s can interact with other innate immune cell types 
such as mast cells and basophils to promote type 2 inflammation in 
the skin®! (Fig. 4). 

Similar to the role for ILC2s in regulating type 2 inflammation in 
the skin, IL-17- and IL-22-producing ILC3s have been associated 
with the inflammatory skin disease psoriasis vulgaris. ILC2s and 
NKp44 ILC3s but not NKp44* ILC3s are present in the peripheral 
blood and skin of healthy people, whereas the blood and inflamed 
skin of those with psoriasis contained NKp44* ILC3s, indicating that 
dynamic changes in ILC population structure are associated with 
psoriatic inflammation’”. In support of this idea, one patient who 
showed a reduction of psoriatic plaques in response to treatment 
with anti-TNF monoclonal antibodies also showed a decrease in cir- 
culating NKp44* ILC3s’”. Studies in mouse models of psoriasis also 
suggest that ILC3s contribute to this disease, because mice treated 
with cream containing imiquimod (a Toll-like receptor 7 agonist that 
elicits a psoriasis-like skin inflammation) had IL-17A-, IL-17F- and 
IL-22-producing ILC3s and y6 T cells in the skin’. 

The involvement of ILCs in various inflammatory and autoim- 
mune disorders, as already discussed, has raised considerable inter- 
est in developing strategies to modify ILC functions to treat these 
diseases. Antibodies against ILC2-activating cytokines or monoclo- 
nal antibodies targeting the cytokines produced by ILC2s, including 
IL-5 and IL- 13, are in clinical trials for the treatment of allergic dis- 
eases such as asthma and chronic rhinosinusitus’”. ILC3 functions 
have also recently been targeted in the context of multiple sclerosis. 
Patients with multiple sclerosis had increased frequencies of ILC3s, 
and treatment with an anti-IL-2R antibody resulted in alleviation 
of inflammation and was associated with a reduction of ILC3s in 
peripheral blood""”. Although this study suggests that ILC3s have a 
role in promoting multiple sclerosis, ILC3s seem to be dispensable 
in a murine model of experimental autoimmune encephalomyeli- 
tis''’. In addition, small molecular compounds (SMCs) that target 
ILCs could also potentially be used to modify ILC functions to treat 
disease. In particular, arachidonic acid metabolites that regulate 
ILC2 function have attracted interest. Signalling by leukotriene D, 
to ILC2s that express the cysteinyl leukotriene receptor 1 (CysLT1R) 
results in secretion of type 2 cytokines, which can be inhibited by 
montelukast, a leukotriene receptor antagonist'”'. Prostaglandin D, 
(PGD,; which is produced by mast cells) binds to one ofits receptors, 
CRTH2, on human ILC2s"* to promote migration and cytokine pro- 
duction, which can be blocked by CRTH2 antagonists'"”'””. Notably, 
mast cells, which are major producers of PGD,, stably interact with 
ILC2s, and the resulting dialogue might induce IL-13 production’. 
Finally, lipoxin A,, an anti-inflammatory metabolite of arachidonic 
acid, binds to ILC2s and inhibits IL-13 production’. SMCs that 
may target ILC3 activities include antagonists of RORyt, which 
were shown to affect biological activities of RORyt-dependent T,17 
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cells’. Although the action of these various monoclonal antibodies 


or SMCs might go beyond ILCs, these data suggest that targeting ILC 
functions might be clinically efficacious in the context of multiple 
disease states. 


ILC2s and metabolic homeostasis 

Emerging data highlighting an association between altered ILC 
responses and obesity, malnutrition and metabolic homeostasis sug- 
gest that ILCs may be crucial responders to nutrient and metabolic 
stress. For example, a population of IL-5- and IL-13-producing ILC2s 
in murine white adipose tissue’’*'’” maintains eosinophil and alterna- 
tively activated macrophage responses that limit high-fat-diet-induced 
obesity and insulin resistance''*. Consistent with this, IL-25-elicited 
ILC2 responses were associated with lower weight gain following 
exposure to a high fat diet’, suggesting that ILC2 responses may 
play an important part in regulating adipocyte development and/or 
function in the context of obesity. Before the discovery of ILCs another 
report demonstrated that delivery of IL-33 to genetically obese mice 
could reduce adiposity and improve insulin tolerance, whereas dele- 
tion of the IL-33 receptor was associated with exaggerated high-fat- 
diet-induced obesity’, supporting a potential role for IL-33 and 
ILC2s in metabolic homeostasis. 

In addition to regulating metabolic homeostasis in white adipose tis- 
sue, ILC2s seem to be poised to rapidly respond to changes in nutrient 
status. Effector cytokine production by ILC2s in the small intestine was 
suppressed following fasting, and is regulated by the circadian clock 
through vasoactive intestinal peptide'’’”. Furthermore, in the setting 
of malnutrition elicited by vitamin A deficiency in mice, an imbalance 
in ILC2 and ILC3 responses was evident, resulting in defective ILC3- 
dependent antibacterial immunity but enhanced ILC2-dependent anti- 
helminth immunity’. Together, these data provoke a model whereby 
alterations in nutritional status directly influence ILC-dependent 
maintenance of tissue homeostasis and host defence against infection. 
However, the mechanisms through which ILCs recognize metabolic 
or dietary stress and how they control metabolic homeostasis remain 
poorly defined. 


ILCs can promote tumour development 

There is ample evidence that NK cells are involved in protection 
against cancer in humans and experimental animals. NK cells are 
particularly efficacious in killing tumour cells that have lost class I 
MHC antigens, a process that could be exploited for cancer therapy’. 
However, studies that investigate the involvement of CD127* ILCs in 
tumour immunity are limited. One study has shown that CCR7*CD4* 
ILC3s promote tumour outgrowth of melanoma cells that were engi- 
neered to express the chemokine CCL21, which was associated with 
the development of a suppressive tumour microenvironment”. In 
this context, whether ILC3s can directly tolerize T cells, as has been 
observed in the gut’, remains to be established. 

Other evidence ofa role for ILCs in tumorigenesis comes from stud- 
ies investigating the pro-carcinogenic roles of cytokines involved in 
the activation and effector functions of ILCs, including IL-23 and 
IL-22. IL-23 receptor expression is increased in human colorectal 
cancer (CRC), and transgenic expression of IL-23 in mice induced 
adenomatous tumours that originated in the duodenum™. In this 
model, a crucial role for ILC3s in tumorigenesis in lymphocyte- 
deficient mice was identified, although the contribution of adaptive 
cells remains unclear!”’. In other models, a possible role for IL-22 in 
tumorigenesis in the gut has been described. IL-22 is important for 
colonic epithelial cell repair, but this activity must be tightly controlled 
by the soluble IL-22 receptor (IL-22 binding protein, IL-22BP) to pro- 
tect against tumorigenesis, because genetic ablation of IL-22BP led to 
uncontrolled IL-22 production that facilitated tumour development™. 
In addition, in a mouse model for inflammation induced by CRC, 
IL-22- and IL-17-producing RORyt-dependent ILC3s in the colon 
caused inflammation and in the presence of a carcinogen promoted 
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development and growth of CRC’. IL-22 might also be involved in 
human CRC because IL-22-producing tumour-infiltrating lympho- 
cytes, including both T cells and non-T cells, were frequently observed 
in CRC, and IL-22 production in the tumour was significantly higher 
than in non-tumour tissue sections from the same patients". How- 
ever, the exact role of IL-22 in tumour growth and the relative impor- 
tance of IL-22-producing ILC3s and T,22 cells in CRC has yet to 
be determined. 

Together, these studies suggest that ILCs can promote tumour 
growth through production of tumour-promoting cytokines and 
by creating a suppressive tumour microenvironment. However, one 
study has reported an ILC3-mediated regression of a skin-residing 
B16 melanoma cell line that was engineered to express IL-12 (ref. 125). 
These ILC3s produced IFNy and IL-17, but the mechanism of tumour 
rejection induced by ILC3s in this model remains unclear”. Although 
it is possible that tumour-derived IL-12 induced differentiation of 
IL-22-producing ILC3s into protective IFNy-producing ex-RORyt* 
ILC3s, as has been observed in the gut’*””, further studies are required 
to understand the plasticity and function of ILCs within the tumour 
microenvironment. 


Outlook and future directions 

The identification of ILCs and the subsequent recognition of their 
diverse functions in integrating non-haematopoietic and haemat- 
opoietic cell responses have provided new insights into how innate 
immune responses and tissue homeostasis are regulated in health and 
disease and how these innate responses affect the adaptive immune 
system. Although present in tissues in relatively low numbers, the 
selective distribution of ILCs within lymphoid and non-lymphoid 
tissues seems to confer a remarkable ability to regulate multiple 
physiological processes throughout the body. A limitation of many 
ILC-related studies has been the inability to genetically target select 
ILC populations in the presence of the adaptive immune system. 
The development of new genetic tools will address this challenge 
and could also lead to the identification of previously unrecognized 
innate immune cell populations that will allow greater understanding 
of the complexity of the immune cell network. Finally, understand- 
ing how ILC responses are dysregulated in the context of infectious, 
metabolic and chronic autoimmune and inflammatory conditions in 
humans might offer therapeutic potential in the treatment of a wide 
range of debilitating diseases. 

Note added in proof: Two papers recently appeared online while 
the current Review was in press reporting a crucial role for ILC2s 
in promoting the beiging of white fat and the regulation of meta- 
bolic homeostasis (J. R. Brestoff et al. Group 2 innate lymphoid cells 
promote beiging of white adipose tissue and limit obesity. Nature 
http://dx.doi.org/10.1038/nature14115 (2014); M. -W. Lee et al. Acti- 
vated type 2 innate lymphoid cells regulate beige fat biogenesis. Cell 
http://dx.doi.org/10.1016/j.cell.2014.12.011 (2014)). = 
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Nutrient-sensing mechanisms 


and pathways 


Alejo Efeyan’***, William C. Comb’? & David M. Sabatini®?*4° 


The ability to sense and respond to fluctuations in environmental nutrient levels is a requisite for life. Nutrient scarcity is 
aselective pressure that has shaped the evolution of most cellular processes. Different pathways that detect intracellular 
and extracellular levels of sugars, amino acids, lipids and surrogate metabolites are integrated and coordinated at the 
organismal level through hormonal signals. During food abundance, nutrient-sensing pathways engage anabolism and 
storage, whereas scarcity triggers homeostatic mechanisms, such as the mobilization of internal stores through autophagy. 
Nutrient -sensing pathways are commonly deregulated in human metabolic diseases. 


utrients (also referred to as macronutrients) are simple 

organic compounds that are involved in biochemical reac- 

tions that produce energy or are constituents of cellular 
biomass. Glucose and related sugars, amino acids and lipids are 
important cellular nutrients, and distinct mechanisms to sense their 
abundances operate in mammalian cells. Essentiality is not neces- 
sarily a hallmark of nutrients; for certain amino acids, such as argi- 
nine, cysteine, glutamine, glycine, proline and tyrosine, essentiality 
is context dependent. In healthy people, the de novo synthesis of 
these amino acids from other molecules meets organismal require- 
ments, but under particular metabolic needs, such as during the rapid 
growth of infants’, they must also be obtained from the environ- 
ment. Nutrient scarcity has operated as a strong pressure for selecting 
efficient mechanisms of nutrient sensing in all species. Considering 
the importance of nutrient homeostasis for all living organisms, and 
for human health in particular, it is surprising that we know relatively 
little about direct nutrient-sensing mechanisms. 

The sensing of a particular nutrient may involve the direct bind- 
ing of the molecule to its sensor, or occur by an indirect mechanism 
relying on the detection of a surrogate molecule that reflects nutri- 
ent abundance. Regardless of the manner in which nutrient sensing 
occurs, for a protein to be considered a sensor, its affinity must be 
within the range of physiological fluctuations of the concentration 
of the nutrient or its surrogate. 

Unicellular organisms are directly exposed to environmental fluc- 
tuations of nutrients, and sense both intracellular and environmental 
nutrient levels. By contrast, most cells in multicellular eukaryotes 
are not directly exposed to changes in environmental nutrients, and 
homeostatic responses aimed at maintaining circulating nutrient 
levels within a narrow range exist. Nevertheless, internal nutrient 
levels do fluctuate, and hence intracellular and extracellular nutrient- 
sensing mechanisms are also present in mammals. In multicellular 
organisms, nutrients also trigger the release of hormones, which act 
as long-range signals with non-cell-autonomous effects, to facilitate 
the coordination of coherent responses in the organism as a whole. 

In this Review, we discuss intracellular and extracellular glucose-, 
amino-acid- and lipid-sensing mechanisms and signalling events in 
mammals; discuss how these sensing mechanisms become deregu- 
lated in human disease; and describe how internal nutrient stores are 
mobilized during nutrient scarcity. 


Lipid sensing 

Lipids are a large and diverse set of nutrients (for example, fatty acids 
or cholesterol) characterized by hydrophobic carbon backbones that 
are used for energy storage and membrane biosynthesis, among other 
cellular processes. Owing to their non-polar nature, lipids are nor- 
mally either packaged into lipoproteins and chylomicrons or bound 
by albumin in the serum’; they are rarely found free in a soluble form 
in the organism. Despite the morbidity caused by high levels of lipid 
intake and deregulated lipid storage, which occurs in obese states, 
our knowledge of lipid-sensing mechanisms, with some exceptions, 
is quite limited. 


Fatty-acid signalling 

A family of G-protein-coupled receptors, best characterized by 
GPR40 and GPR120, detects long-chain unsaturated fatty acids. In 
mechanisms that are not fully understood, free fatty-acid stimulation 
of GPR40 at the plasma membrane of pancreatic B-cells augments 
glucose-stimulated insulin release* (Fig. 1a). GPR120 also mediates 
insulinotropic activity, albeit by an indirect mechanism, involving 
production of GLP1 in the gut and the release into circulation. GLP1 
belongs to a group of gastrointestinal hormones called incretins that 
promote insulin release in B-cells*. These examples demonstrate 
how an increase in one particular nutrient (fatty acids) anticipates a 
response to the imminent increase in another nutrient (glucose), as 
food intake rarely provides solely one nutrient species. In addition, 
activation of GPR120 at the plasma membrane of white adipocytes 
leads to a signal transduction cascade that promotes phosphati- 
dylinositol-3-OH kinase (PI(3)K) and AKT activation, leading to 
the cell-autonomous induction of glucose uptake (Fig. 1a). Genetic 
mutations that disrupt GPR120 function occur in people who are 
obese, and ablation of Gpr120 in mice contributes to diet-induced 
obesity, suggesting that this signal transduction pathway has a key 
role in the systemic control of nutrient homeostasis’. Naturally, these 
findings have spurred interest in the development of GPR120 ago- 
nists to control the onset of obesity’. 

In addition to GPR120, the CD36 (also known as FAT) recep- 
tor has been implicated in direct binding and uptake of intestinal 
lumen fatty acids’, and, interestingly, GPR40, GPR120 and CD36 
have fatty-acid-sensing properties in cells within the oral epithelium 
that are involved in gustatory perception” (Fig. 1a). 
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Cholesterol sensing 
Our limited knowledge about the sensing of other lipid species is in 
contrast with our profound understanding of the cholesterol-sensing 
mechanism, which was deciphered by Brown and Goldstein”. Sterols, 
including cholesterol, are fundamental constituents of mammalian mem- 
branes that provide membrane fluidity and are needed for the synthesis 
of steroid hormones. Cholesterol can be obtained from the diet, as well as 
synthesized de novo. Hence, adequate sensing of internal cholesterol levels 
allows the energetically demanding cholesterol biosynthetic pathway to 
be controlled, so that it is only active when external supply and internal 
levels of sterols are low. Cholesterol sensing occurs in close proximity to 
the regulation of the cholesterol biosynthetic pathway: the cholesterol- 
sensing protein (cholesterol-sensing protein SREBP | cleavage activating 
protein, SCAP), and the transcription factor that induces the expression 
of enzymes involved in the cholesterol biosynthetic pathway, form a con- 
stitutively bound complex on the endoplasmic reticulum (ER). SCAP 
directly binds cholesterol by a region originally found to span its five 
transmembrane sterol sensing domains (SSDs)’*"*. The initial mapping 
observations were later refined to a loop in the ER lumen side of the mem- 
brane, probably embedded in the lipid bilayer” (Fig. 1b). SCAP is consti- 
tutively bound to sterol regulatory element-binding proteins (SREBPs), 
which transactivate genes that are crucial for cholesterol synthesis. When 
cholesterol levels are high, cholesterol binding to SCAP triggers a con- 
formational change that increases its affinity for the INSIG proteins’®, 
an anchor for SCAP and SREBP within ER membranes. Conversely, 
when cholesterol levels are low and SCAP is not bound to cholesterol, 
the SCAP-SREBP tandem dissociates from INSIG and shuttles to the 
Golgi apparatus” (Fig. 1b). This step is essential because the presence of 
the SCAP-SREBP complex at the Golgi allows the cleavage and release 
of the cytoplasmic amino-terminus of SREBP by proteases that are resi- 
dent at the Golgi". In turn, the cleaved cytoplasmic fragment of SREBP 
translocates to the nucleus and induces genes involved in lipid anabolism. 
Replete cholesterol levels then initiate a slow negative feedback by inter- 
acting with SCAP and inhibiting further cleavage of SREBP”. 
Substantial evidence supports an additional sterol-sensing event that 
occurs within the ER, involving the enzyme HMG-CoA reductase. 
HMG-CoA reductase, a transcriptional target of SREBP, catalyses the 
rate-limiting step in de novo cholesterol synthesis in response to low cho- 
lesterol levels. The carboxy-terminus of HMG-CoA reductase, containing 
its catalytic activity, is exposed to the cytoplasm, whereas several trans- 
membrane domains, including the sterol-sensing domain reminiscent 
of that of SCAP, are embedded in the ER membrane”. High levels of 
intermediate lipid species in cholesterol synthesis, such as lanosterol, trig- 
ger the binding of HMG-CoA reductase to INSIG, which is also bound 
constitutively to an ubiquitination complex formed by VCP, GP78 and 
UBC7. This interaction promotes the ubiquitin-mediated degradation of 
HMG-CoA reductase” (Fig. 1c). As mentioned, HMG-CoA reductase 
catalyses an early (and rate limiting) step in cholesterol synthesis, but the 
levels of HMG-CoA reductase are regulated by a slow, transcriptional 
mechanism that is shut off only after cholesterol levels have been replen- 
ished. Hence, the interaction of HMG-CoA reductase with INSIG, leading 
to its turnover by the proteasome, constitutes a faster regulatory loop that 
aims to put a brake on cholesterol synthesis when the presence of precur- 
sor molecules guarantees its imminent increase. 


Sensors upstream of adipokines 

Adipokines, hormones secreted by adipocytes, exert systemic effects 
that include the regulation of appetite, energy expenditure and other 
processes that contribute to nutrient homeostasis. Their levels do not 
necessarily reflect circulating lipid levels, but relate to organismal 
lipid storage”, and some adipokines, such as leptin, can be consid- 
ered as surrogate indicators of lipid-storage abundance. Surprisingly, 
the identity of the sensor that connects high levels of stored lipids 
with leptin production remains a mystery, despite the identification 
of regulatory elements in the promoter region of the LEP gene”. We 
know considerably more regarding the systemic effects downstream 
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Figure 1 | Lipid-sensing mechanism. a, Fatty-acid (FA) detection 
mechanisms by GPR40 and GPR120 (left) and CD36 (right). GPR family 
members are expressed in several cell types including enteroendocrine cells, 
taste buds and white adipocytes; in the enteroendocrine cells, binding to 

fatty acids occurs on the luminal side, and the signal is transduced through 

a G protein, leading to the release of incretins into the circulation; in taste 
buds, they trigger the release of neurotransmitters; and in white adipocytes, 
activation of GPR120 indirectly promotes glucose uptake. Binding of CD36 to 
free fatty acids in oral taste buds triggers calcium release from the endoplasmic 
reticulum (ER) and neurotransmission; in enterocytes, it directly promotes 
fatty-acid uptake. b, Cholesterol sensing by SREBP1 cleavage-activating 
protein (SCAP). In the presence of cholesterol, the SCAP-SREBP complex 
binds the INSIG proteins at the ER membrane and remains anchored in the 
ER. When cholesterol is absent and SCAP-SREBP does not bind INSIG, the 
complex traffics to the Golgi where the cytoplasmic tail of SREBP gets released 
by proteolytic cleavage, and triggers a cholesterol synthesis transcriptional 
program at the nucleus, including the synthesis of HMG-CoA reductase 
(HMGCR). c, The enzyme HMGCKR catalyses a rate-limiting step in cholesterol 
synthesis, and is synthesized when cholesterol levels are low. HMGCR is 
embedded in the ER membrane and has cytoplasmic domains, which include 
its catalytic activity (right). In the presence of abundant intermediate species 
in the cholesterol biosynthetic pathway (such as lanosterol), HMGCR interacts 
with the INSIG proteins, constitutively bound to an ubiquitination complex. 
This leads to HMGCR ubiquitination and degradation and halts the synthesis 
of cholesterol in a rapid regulatory mechanism, which is key to the anticipation 
of an imminent increase in cholesterol levels. 


of leptin. Leptin receptor (LEPR) is expressed both in the central 
nervous system and in peripheral tissues and its activation coordi- 
nates food intake and organismal metabolism. In hypothalamic neu- 
rons that suppress appetite (anorexigenic neurons), leptin activity 
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Figure 2 | Amino-acid-sensing mechanisms. a,GCN2 detects insufficiencies 
of cellular amino acids (AAs). During low levels of any amino acid, its 

cognate aminoacyl transfer RNA synthetase (aaRS) fails to load the tRNA 
(purple and blue structures). The unloaded tRNA is then detected by GCN2 
kinase, which halts translation initiation. b, Mechanistic target of rapamycin 
complex 1 (mTORC1) is activated downstream of elevated intracellular 

amino acids through its recruitment to the outer lysosomal surface by a Rag 
GTPase-mediated mechanism. The identity of the sensor for amino acids 
remains unclear, and several non-mutually exclusive possibilities exist: an 
intra-lysosomal sensor that transduces the signal through the membrane, a 
lysosomal transmembrane sensor that both detects and transduces the signal, 
and a cytoplasmic sensor that operates downstream of amino-acid export 
from the lysosome. c, Sensing of extra-organismal amino acids by oral taste 
receptors. The heterodimeric receptor T1R1-T1R3 binds amino acids at high 
concentrations only, and triggers a signal transduction cascade through a 

G protein. In the intestinal epithelium, it also leads to the localization of GLUT2 
to the apical membrane, facilitating glucose import. 


antagonizes the effect of appetite-stimulating neuropeptides and 
neurotransmitters. Lipid mobilization by adipocytes, as occurs in 
fasting states, results in decreased leptin production, thereby stim- 
ulating appetite and promoting nutrient acquisition behaviour. 
Indeed, mutations in the LEPR gene were found in people who are 
morbidly obese”, and mice harbouring inactivating mutations in 
Lep* or Lepr” are hyperphagic to the extent that they can be double 
the mass of normal mice. 

In addition to leptin, adipocytes also synthesize the hormone 
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adiponectin (encoded by ADIPOQ)*"', although we have even less of an 
understanding of the regulation of its production”. In contrast to leptin, 
circulating adiponectin levels inversely correlate with lipid storage, and 
this adipokine exerts a multitude of systemic effects that include the pro- 
motion of energy expenditure, insulin sensitivity and loss of appetite”. 
Mutations and polymorphisms in the human ADIPOQ gene strongly 
correlate with obesity and the development of type 2 diabetes”. 


Amino-acid sensing 

Amino acids are the building blocks for proteins, the most abundant 
macromolecules in cells. Protein synthesis is energetically expensive 
and complex; accordingly, cells sense extracellular and intracellular 
amino acids to couple their abundance to use. When amino acids are 
scarce, proteins constitute reservoirs of amino acids that catabolic 
programs, such as proteasome-mediated degradation and autophagy, 
mobilize. Amino acids are subsequently recycled and allocated for 
the synthesis of specific proteins required under nutrient limitation. 
Furthermore, during periods of prolonged starvation and hypogly- 
caemia, amino acids are catabolized for the production of other forms 
of energy, such as glucose and ketone bodies, which are required to 
fuel the particular needs of certain organs (for example, the brain). 
Hence, the accurate sensing of amino-acid levels is key for the effi- 
cient regulation of protein and amino-acid synthesis and catabolism, 
as well as for the control of food intake. 


GCN2 

In protein synthesis, no amino acid compensates for the absence of 
another, therefore, the cell must be able to efficiently detect the lack of 
any amino acid to prevent potential failures in peptide-chain synthe- 
sis. The structural unit of protein-synthesis machinery, the ribosome, 
incorporates amino acids into a nascent peptide by the sequential bind- 
ing of a specific transfer RNA covalently linked to its cognate amino 
acid. Amino-acid-specific aminoacyl tRNA synthetases (aaRSs) execute 
the loading of amino acids to their cognate tRNAs”, and uncharged 
tRNAs accumulate during low levels of free amino acids. Failure to fin- 
ish a peptide chain due toa stalled ribosome under amino-acid scarcity 
is inefficient and energetically onerous, so cells anticipate this situation 
by preventing the initiation of translation. The mechanism involves a 
single protein that is able to detect any uncharged tRNA, regardless of 
its amino-acid specificity, allowing for the detection of low levels of any 
amino acid in the context of an abundance of the other 19 amino acids. 
This protein is general control nonderepressible 2 (GCN2), which has a 
high affinity to all uncharged tRNAs” (Fig. 2a), and represents an elegant 
example of amino-acid sensing by the detection ofa surrogate molecule. 
Under low intracellular amino acid levels, the binding of GCN2 to a given 
uncharged tRNA triggers a conformational change that leads to kinase 
activation and inhibitory phosphorylation of a key early activator of 
translation initiation: eukaryotic translation initiator factor 2 a (elF2a)"". 
Mouse models have proven the importance of GCN2 and eIF2a in mam- 
malian responses to transient drops in amino acids*”” and, interestingly, 
this amino-acid-sensing pathway seems to playa key part in the central 
nervous system for the detection of imbalances in amino-acid composi- 
tion in food, independent of taste. 

Inhibition of protein synthesis by GCN2 and eIF2a occurs in 
concert with other cellular responses to amino-acid depletion, such 
as the inhibition of the mechanistic target of rapamycin (mTOR) 
pathway (see ‘mTORC1’). This restricts translation to those mes- 
senger RNAs encoding proteins required for cellular adaptation to 
nutrient starvation, while impairing synthesis of most other pro- 
teins’. Minimizing translation also enables amino acids to be used 
as energy sources. 


mTORC1 

The mTOR kinase, when part of mTOR complex 1 (mTORC1), controls 
cellular energetics by inducing numerous anabolic processes, including 
protein and lipid synthesis’. Growth factors activate mTORC1 through 
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a well-understood signal transduction cascade initiated by the binding of 
areceptor at the plasma membrane, and culminating in the activation of 
the Rheb GTPase. Rheb directly binds mITORC1 and activates its kinase 
ina growth-factor-dependent manner*. In addition to regulation by 
hormones, intracellular amino acids also activate mT'ORCI1, so the com- 
plex integrates information on both systemic and cellular nutrient levels. 
In spite of the fact that mTORC1 activity is highly responsive to changes 
in amino-acid levels, it is not an amino-acid sensor. Indeed, mMTORC1 
activation is one of the several examples of a key sensing signalling pro- 
cess for which, despite intense interest, the actual nutrient sensors remain 
unidentified (Fig. 2b). mTORC1 is not equally sensitive to all amino acids 
— leucine, for example, is particularly important for its activation”’. We 
can only speculate about the selective importance of leucine levels for 
mTORC1 activation; it is one of the most abundant amino acids in pro- 
teins, and hence, more likely to be limiting during protein synthesis. 
Intriguingly, GCN2-knockout mice fed a leucine-deficient diet have a 
more severe phenotype than the same animals fed diets lacking trypto- 
phan or glycine*’. Thus, leucine seems to be crucial for the organismal 
sensing of amino-acid sufficiency and deprivation by different pathways. 
The molecular characterization of the amino-acid-dependent activation 
of mTORC1 started only a few years ago with the identification of the 
Rag family of GTPases™*», which regulate mTORC1 through a mecha- 
nism distinct to that of growth factors. Whereas growth factors regulate 
the kinase activity of mTORC1, the Rag GTPases recruit MTORC1 to 
the outer lysosomal surface, an essential step in its activation”. Because 
mTORCI kinase activation by Rheb occurs at the outer lysosomal sur- 
face, it is only possible following Rag GTPase-dependent recruitment of 
mTORCI (Fig. 2b). Hence, amino-acid abundance and the consequent 
recruitment of mTORC1 is a prerequisite for the activation of MTORC1 
by growth factors (Fig. 2). Although the sensors for amino acids have not 
been identified so far, a few pieces in the puzzle of amino-acid-dependent 
regulation of mTORCI1 have recently been added. Cell-based biochemi- 
cal studies have identified the proteins responsible for tethering the Rag 
proteins to the lysosomal surface”, guanine exchange factors (GEFs) and 
GTPase-activating proteins (GAPs), as well as other regulatory proteins 
operating upstream of the Rag GTPases”. 

The reason for a lysosomal-centred mechanism of mTORC1 activation 
may be puzzling, but independent pieces of evidence suggest that the 
lysosome has a key role in amino-acid homeostasis. The yeast vacuole, 
an organelle equivalent to the mammalian lysosome, accumulates nutri- 
ents such as amino acids™, and mTORCI recruitment is conserved in 
yeast®. In addition, high intraluminal concentrations of certain amino 
acids have also been shown in lysosomes™. Protists such as Dictyostelium 
discoideum obtain energy through phagocytosis and lysosomal degrada- 
tion”, which is followed by a transient increase in intralysosomal nutrient 
levels. Finally, both the lysosome and the vacuole are the organelles in 
which amino acids and other nutrients are scavenged from cellular com- 
ponents, through the catabolic process of autophagy. Hence, high levels 
of amino acids within the lysosome or vacuole system seem to reflect, 
to some extent, cellular amino-acid abundance, and so it is reasonable 
to couple its sensing with recruitment and activation of MTORC1 —a 
crucial regulator of most anabolic processes, including protein synthesis. 

Germline and sporadic mutations in genes involved in the signal trans- 
duction of nutrient levels upstream of the Rag GTPases have been found 
in human syndromes characterized by growth defects, neurological dis- 
orders, skin and immunological problems, and tumorigenesis", 


Amino-acid-sensing taste receptors 

As strict heterotrophs, mammals must obtain energy and nutrients 
from external organic sources. Predicting the nutritional value 
of food before digestion allows for the accurate selection of food 
sources and for the anticipation of increased nutrient abundance. 
Several mechanisms act synergistically, including experience and 
social rules in humans, but a fundamental nutrient-sensing event 
occurs at the level of the oral taste buds. Nutrient sensing by taste 
receptors is not just a means of sensing extracellular nutrients, it is 
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Figure 3 | Glucose-sensing mechanisms. a, Glucose sensing by the GLUT2 
transporter. Owing to low affinity, this transporter actively imports glucose only 
during high glycaemic states (right). Its bidirectional properties mean it can 

also export glucose from hepatocytes into the circulation under hypoglycaemic 
states if hepatic gluconeogenesis and glycogen breakdown raise the intrahepatic 
glucose levels (left). b, Intracellular glucose sensing by glucokinase (GCK) 

in hepatic and pancreatic cells. GCK has low affinity for glucose, and shunts 
glucose-6-phosphate (G6P) into either glycolysis or glycogen synthesis only 
when glucose is abundant. c, The mechanism of insulin release downstream 

of glucose sensing in pancreatic B-cells. This is a multi-step process that relies 
on glucose phosphorylation by GCK, subsequent ATP production and ATP- 
mediated blockade of potassium channels. This leads to a calcium influx that 
facilitates the release of insulin from vesicles into the bloodstream. d, Extra- 
organismal glucose sensing by oral taste receptors. The dimeric receptors T1R2- 
T1R3 bind at high concentrations of glucose, sucrose, fructose and artificial 
sweeteners only, and trigger a signal transduction cascade through a G protein. 


a mechanism of extra-organismal sensing that allows the interroga- 
tion of prospective food sources. In humans, taste is divided into five 
categories: sweet, umami, bitter, sour and salty, and is generated by 
signals elicited in taste buds, groups of cells in the tongue, palatal 
and oesophageal epithelium. Within these cells, the taste receptors 
are, logically, exposed in the apical membrane oriented towards the 
environment”. 

Taste receptors belong to the T1R and T2R families of G-protein- 
coupled receptors, and are characterized by seven transmembrane 
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domains with an extracellular N-terminus and an intracellular 
C-terminus. (For molecular and genetic information regarding the 
different members of the taste receptor genes see ref. 71.) The T2R 
family is involved in the detection of bitter molecules, a category that 
includes potentially toxic compounds, and two T1R family members 
are responsible for sensing the presence of amino acids (the umami 
taste). Although other taste receptors also exist’"”’, elegant genetic 
studies using heterologous expression experiments showed that 
the T1R1-T1R3 heterodimer senses amino acids (Fig. 2c). Human 
amino-acid taste receptors have a particularly high affinity to gluta- 
mate, but other L-amino acids also serve as ligands, whereas D-amino 
acids do not”’. Amino-acid binding to a taste receptor triggers signal 
transduction through the plasma membrane, followed by G-protein 
activation and neurotransmitter release”, which is then integrated 
with other neurotransmission events at the level of the central nerv- 
ous system. 

In addition to the presence of taste buds in the oral epithelium, 
taste receptors also exist in endocrine cells in certain regions of the 
gut”’. Intestinal taste receptors operate through G-protein activa- 
tion ina similar manner to that of the oral epithelium, but instead of 
inducing the release of a neurotransmitter that activates an afferent 
signal to the brain, the cascade elicited by enteral taste receptors cul- 
minates in the release of incretins into the blood circulation, serving 
as an anticipatory signal for the imminent digestion of, and systemic 
increase in, nutrients. 

Interestingly, extracellular amino-acid sensing at the plasma mem- 
brane by taste receptors can modulate mTORC1 activation without 


affecting intracellular amino-acid levels”’, a meaningful cross-talk 
that engages the anabolic machinery of the cell in anticipation of an 
elevation in intracellular amino-acid levels, following import. 


Glucose sensing 

Mammals rely on multiple ways of maintaining glucose levels within 
a narrow physiological range. Glucose intake, storage, mobilization 
and breakdown are tightly regulated at different levels, and multiple 
mechanisms of glucose sensing coexist: extra-organismal, extracel- 
lular and intracellular. In addition, a network of hormone signals, 
exemplified by insulin and glucagon, aim to coordinate coherent 
responses to systemic glucose levels in distant organs. Deregulated 
glucose homeostasis mechanisms, from glucose sensing to import, 
storage and mobilization underlie the pathogenesis of human dis- 
eases such as type 2 diabetes. 


Glucokinase 

Glucokinase (GCK) catalyses the first step in the storage and con- 
sumption of glucose, glycogen synthesis and glycolysis, and its 
function constitutes a simple, direct intracellular nutrient-sensing 
mechanism that controls systemic glucose homeostasis. Like all 
hexokinases, GCK phosphorylates glucose to make glucose-6-phos- 
phate (G6P), but unlike the other isozymes, only GCK functions as 
a glucose sensor’’. This uniqueness occurs because unlike the other 
hexokinases, which have K,, values (an inverse measure of affinity) 
for glucose much below the minimum physiological level of glucose, 
GCK has a significantly lower affinity and is only active when glucose 
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Figure 4| Nutrients and autophagy. Autophagy serves as an internal 

source of stored nutrients under conditions of nutrient limitation. Two main 
regulatory inputs for autophagy are AMP-activated protein kinase (AMPK) and 
mechanistic target of rapamycin complex 1 (mTORC1). Autophagy initiation 
can be promoted by the activation of ULK1 through AMPK-dependent 
phosphorylation during low ATP:AMP ratios. mTORC1 is activated by growth 
factor-mediated signal transduction at the outer lysosomal surface if cellular 
amino acids and glucose have recruited mTORCI through the action of the Rag 
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GTPases. Activated mTORCI inhibits ULK1 and ATG13 by phosphorylation. 
Hence, low nutrients promote autophagy by the inhibition of mTORC1. 
Autophagy starts with the engulfment of the cellular constituents glycogen, 
lipids from lipid droplets, soluble proteins, ribosomes or organelles in a double- 
membrane structure that then fuses with lysosomes, in which enzymatic 
breakdown occurs. The products of autophagy, basic nutrients (sugars, lipids, 
amino acid and nucleosides), are then exported into the cytoplasm, in which 
they may be used as a source of energy, or re-used for anabolism. 
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levels are relatively high (around 120 mg dl, or 7 mM, or greater). 
Hence, although the other hexokinases function as ‘phosphoryla- 
tion machines’ regardless of the actual glucose levels, GCK is active 
only during glucose abundance, and it controls systemic glucose fate 
through its effects in the liver and pancreas (Fig. 3b). The liver main- 
tains glycaemia through gluconeogenesis and glycogen breakdown 
during periods of systemic glucose scarcity, or by storing glucose in 
the form of glycogen when it is in excess”*. GCK is the most abundant 
hexokinase in the liver and because it is inactive under conditions of 
glucose limitation, it permits export of unphosphorylated glucose 
from the liver in order to supply the energetic demands of the brain 
and muscles. When hepatic glucose levels are high, GCK-mediated 
conversion of glucose to the metabolic intermediate G6P allows it 
to be shunted into glycolysis (for energy production) or glycogen 
synthesis (for storage). 

GCK is also expressed in B-cells (see ‘GLUT2’), and in neurons and 
glial cells in the hypothalamus. Although work remains to be done 
to understand the role of this glucose sensor in the brain, systemic 
effects, such as feeding responses and insulin release, are likely to be 
downstream of hypothalamic GCK activity”. Dozens of germline 
mutations in GCK in people with abnormal glycaemia and diabetes”, 
together with conditional deletion of the murine Gck gene in the liver 
and pancreas”, support the fundamental role of GCK in maintaining 
organismal glucose homeostasis. 


GLUT2 
The glucose transporter GLUT2 (also known as SLC2A2) is a sensor 
of extracellular glucose levels, and like GCK, GLUT2 has a higher 
K,, (20 mM) than other glucose transporters of the same family. The 
K,, for GLUT1 is around 1 mM and that of GLUT4 is about 5 mM 
(ref. 82), so they are close to saturation even during fasting glycaemia 
(around 4 mM). The low affinity of GLUT2, by contrast, allows for 
efficient transport of glucose across the plasma membrane only when 
glycaemia is high, but not under the low concentrations that still satu- 
rate the other transporters. Accordingly, GLUT2 has crucial roles in 
directing organismal glucose handling after feeding. Hepatic glucose 
import mediated by GLUT2 is followed by GCK-dependent phos- 
phorylation for storage and energy production, as already described. 
Importantly, during periods of low glycaemia, hepatic glycogenolysis 
and gluconeogenesis increase intrahepatic glucose levels. Because 
GLUT2 can transport glucose in a bidirectional manner, it exports 
glucose to the circulation (Fig. 3a). Hence, GLUT2-mediated import 
occurs only during transient hyperglycaemic states, and GLUT2- 
mediated export only happens when intrahepatic glucose levels are 
high, thus constituting a key controller of glucose homeostasis. Not 
surprisingly, inactivating mutations in GLUT2 lead to human meta- 
bolic disorders, such as Fanconi-Bickel syndrome characterized by 
deregulated glycogen accumulation, hepatomegaly and hypoglycae- 
mia, among other symptoms of disrupted glycaemic homeostasis”. 

B-Cells in the pancreas have a specialized role in sensing systemic 
glucose levels, and are responsible for the synthesis and secretion of 
insulin. Glucose is imported in B-cells and phosphorylated by the tan- 
dem of GLUT2 (or GLUT1) and GCK, and, as it is consumed, leads 
to an increased ATP:ADP ratio. This closes potassium channels at the 
plasma membrane, and causes the membrane to depolarize. Dissipa- 
tion of membrane potential results in a transient increase of intracellular 
calcium that facilitates the fusion of insulin-containing vesicles with the 
plasma membrane, releasing its cargo into systemic circulation (Fig. 3c). 
It is important to mention that whereas the predominant transporter in 
murine B-cells is GLUT2, the relative abundance of the GLUT2 trans- 
porter in human islets seems to be minor compared with that of the high 
affinity GLUT 1 transporter — so the relevance of GLUT2 for glucose 
transport in human f-cells is not clear™. 

Elevated sugar intake and chronic hyperglycaemia deregulate 
normal glucose sensing through several mechanisms, including ER 
stress, increased intracellular Ca”* levels, mitochondrial dysfunction, 
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reactive oxygen species and chronic inflammation, all of which con- 
tribute to the corruption of insulin secretion in type 2 diabetes”. 

Finally, although the other glucose transporters (such as GLUT1 
and GLUT4) do not behave as sensors, their activities and effects 
are regulated by different means in order to meet particular require- 
ments of glucose use and storage. GLUT4 is expressed in skeletal 
muscle and adipose tissue, two organs important for post-prandial 
glucose uptake and storage™, and although GLUT has a low K,,, 
glucose uptake in these organs is a regulated process. Insulin triggers 
a PI(3)K-AKT dependent signal transduction cascade that results 
in GLUT4 localization to the plasma membrane, allowing glucose 
uptake in these tissues*®. Because glucose import and storage are insu- 
lin dependent, and thus secondary to direct glucose-sensing mecha- 
nisms in the liver and pancreas, they occur only after the organism 
has reached a threshold of internal glucose abundance. GLUT] is 
expressed in fetal tissues and its constant activity provides glucose to 
all tissues to sustain the rapid growth of the organism. 


AMPK and ATP: AMP ratios 

AMP-activated protein kinase (AMPK) is a fundamental regulator 
of cellular metabolism and coordinates several metabolic responses 
in different cell types. It is exquisitely responsive to cellular energy 
levels; as a surrogate sensing mechanism for glucose abundance, 
increased levels of AMP and ADP directly activate the kinase. AMPK 
has been the subject of a number of excellent reviews addressing its 
activation, regulation, and downstream consequences*’*’, and will be 
briefly discussed here in the context of the regulation of autophagy. 


mTORCI and the sensing of glucose 

The regulation of mTORC1 through Rag-GTPase-mediated recruit- 
ment is not restricted to amino acids; cellular glucose levels also affect 
the activity of the Rag GTPases’. In contrast to the identification of 
some molecular players involved in the activation of the Rag GTPases 
by cellular amino acids, less clear is the mechanism by which glu- 
cose regulates the Rag GTPases. Some aspects downstream of glucose 
and amino-acid sensing are shared, such as the involvement of the 
lysosomal v-ATPase***””*, but additional players remain unidenti- 
fied. Because the amino-acid- and glucose-sensing mechanisms are 
generally independent phenomena, as we illustrate, it is very likely 
that amino-acid and glucose sensing upstream of mTORCI occur 
in parallel and converge upstream of the Rag GTPases, but precisely 
how this integration occurs is unresolved. 


Glucose-sensing taste receptors 

In a similar manner to amino-acid sensing in taste buds by TIR1- 
T1R3, the heterodimer composed of T1R2-T1R3 constitutes the glu- 
cose taste receptor (Fig. 3d). The extracellular N-terminal domains of 
both T1R1 and T1R2 are essential for determining their specificity for 
their natural ligands”’. Millimolar concentration of the saccharides 
glucose, fructose or sucrose activate the T1R2-T1R3 receptor”; this 
concentration may seem high, but sucrose concentration in an apple 
is around 100-200 mM, and so this process is selective and efficient 
for the detection of highly energetic foods. 

T1R2-T1R3 receptors are also expressed in the intestinal epithe- 
lium, and although the sensing process is identical to that of the oral 
epithelium, the signal transduction does not trigger an afferent signal 
to the brain, but results in the transient localization of the GLUT2 
transporter to the apical membrane, leading to increased absorption 
of glucose from the intestinal lumen after feeding”. 

In addition to natural ligands, glucose taste buds also respond to 
artificial sweeteners such as saccharine, cyclamate and aspartame”. 
Activation of glucose taste receptors by artificial ligands has clini- 
cal implications for obesity and type 2 diabetes, as sweeteners may 
increase nutrient absorption and activate other nutrient-sensing 
signalling cascades at different levels, regardless of nutritional 
value. Indeed, some studies have shown that consumers of artificial 
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sweeteners are at higher risk of developing metabolic disease’. The 
phenomenon of artificial activation of this nutrient-sensing pathway 
is currently an active field of research. 


Accessing internal nutrient stores through autophagy 

Because environmental nutrient availability can be intermittent, cells 
and organisms have evolved efficient ways of storing nutrients dur- 
ing periods of abundance. This occurs in unicellular organisms and 
is more obvious and prominent in animals, with the emergence of 
organs specialized in nutrient storage, such as fat tissue, the liver and 
skeletal muscle. Mammalian cells accumulate and store glucose in 
the form of glycogen, lipids within lipid droplets and internal mem- 
branes, and amino acids in proteins and organelles; all of which can 
be mobilized and catabolized to endure periods of nutrient limitation. 
Cells exploit different means to obtain the basic nutrients from inter- 
nal stores, including autophagy, the controlled process of recycling of 
cellular constituents confined within a double membrane structure. 
Autophagy starts with the de novo formation of a membrane struc- 
ture termed the phagophore, which engulfs its cargo and closes as a 
cytoplasmic double-membrane autophagosome. An autophagosome 
then fuses with a lysosome, which leads to the enzymatic breakdown 
of the autophagosomal cargo into its basic building blocks, which are 
then exported from the autophagolysosomes and further catabolized 
to produce energy, or used again in other anabolic reactions (Fig. 4). 

The process of autophagy is unique because it can target any cel- 
lular component and nutrient storage depot, and, as a key internal 
source when nutrients are scarce, is highly regulated at multiple 
levels by nutrients and nutrient signalling”. AMPK, directly acti- 
vated by a low ATP: ADP ratio, phosphorylates and activates ULK1, 
a kinase that regulates autophagy initiation’””*. AMPK also activates 
the FOXO transcription factors, which transactivate the ATG genes 
responsible for the initiation and completion of autophagy”. Hence, 
AMPK acutely regulates autophagy, as well as by means of a slower, 
transcriptional mechanism. 

A crucial regulator of autophagy, as shown in all eukaryotes using 
both cultured cells and model organisms, is mTORC1, through its 
inhibitory phosphorylation of ULK1 and ATG13 (ref. 100). mTORC1 
seems to play a dominant part in the regulation of autophagy, as 
mTORC1 inhibition is sufficient to induce it'”', whereas its constitu- 
tive activation is sufficient to block it®. Nutrient depletion is perhaps 
the most potent inducer of autophagy, and the regulation of mTORC1 
by the Rag GTPases downstream of nutrient scarcity seems to be 
essential for the regulation of autophagy. Mice with constitutive RagA 
activity, and hence, constitutive activation of mTORC1 regardless of 
nutrient levels, develop normally but die within the first day of life, 
similar to mice lacking the essential autophagy genes Atg5 and Atg7 
(refs 89,102,103). Constitutive RagA activity in neonatal mice leads 
to a profound glucose and amino-acid homeostasis defect second- 
ary to an impairment in the detection of nutrient shortage after the 
transplacental supply of nutrients is interrupted at birth. This leads 
to constitutive MTORC1 activity and the consequent inability to trig- 
ger autophagy. 

In addition to the regulation of autophagy initiation, mMTORC1 
activity is required for autophagy termination™. Cellular free amino 
acids, produced by autophagy, result in an increase in mTORCI activ- 
ity and the reformation of lysosomes. Systemic levels of nutrients also 
regulate autophagy through the effects of insulin'®*. The intracellu- 
lar cascade of insulin activates AKT, a positive input for mTORC1, 
and a negative regulator of the FOXO transcription factors. Hence, 
both local and systemic nutrients regulate the process. In addition to 
nutrients, stresses such as hypoxia, ER stress and DNA damage also 
regulate autophagy’. 

Several studies that generated autophagy-deficient tissues in a tem- 
poral specific manner have determined the importance of autophagy 
in mammalian physiology. Besides the aforementioned role of 
autophagy in the early neonatal starvation period’”””’, autophagy is 
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essential for the survival of embryos in the pre-implantation stage'””. 
Whole-body acute deletion of autophagy genes in adult mice eventu- 
ally culminates in neurodegeneration and death, presumably owing 
to the accumulation of harmful organelles and proteins, which 
probably cause neuronal toxicity"*””’. Liver-specific impairment in 
autophagy results in accumulation of abnormal cellular endomem- 
branes, mitochondria and ubiquitinated proteins”, and impaired 
lipid mobilization’. Impaired autophagy seems to preferentially 
affect cells specialized in vesicle trafficking, such as lymphocytes 
and B-cells’, but some of these effects may be due to a deranged 
endomembrane trafficking system, rather than a direct consequence 
of a nutrient homeostasis defect. 


Future directions 

Despite intense research, our understanding of nutrient-sensing 
mechanisms is far from complete. For instance, we have not yet 
deciphered what links lipid storage levels with leptin synthesis and 
release. Equally unclear is what the glucose and amino-acid sensors 
upstream of mTORC1 are. Towards the identification of nutrient sen- 
sors upstream of mTORCI1, the lysosome seems to be a key organelle 
in sensing; however, we still need to determine what is sensed, and 
how, at the lysosome. Besides these and other fundamental unan- 
swered questions of direct nutrient sensing, the mechanisms dis- 
cussed in this Review are outlined mostly in a modular manner. This 
reflects our lack of an integrative view of the nutrient-sensing path- 
ways, and connecting the different aspects of nutrient sensing will be 
one of the challenges of future research. We know that mTORC1 is 
a node at which hormone and nutrient inputs converge, but we still 
do not know whether these signalling cascades cross-talk upstream 
of mTORC1. A complete view of nutrient-sensing mechanisms will 
address potential cross-regulation between different nutrient-sensing 
pathways, but also incorporate regulation by other signalling events. 
For example, we know some of the consequences of chronic inflam- 
mation in deregulating nutrient-sensing mechanisms and the signal- 
ling cascades downstream, such as those that occur in obese states, 
but how exercise modulates nutrient inputs, or how ageing affects 
nutrient-sensing abilities, remain to be determined. From an experi- 
mental point of view, advances in genomics will probably provide 
insight into clinical conditions secondary to deregulated nutrient 
sensing, such as the identification of novel mutations and polymor- 
phisms in humans. Finally, nutrient abundance not only affects the 
onset of diabetes, but also influences cancer development and the 
ageing process. Nutrient sensing and metabolism in cancer cells has 
received a new wave of attention, partly thanks to advances in next- 
generation sequencing and metabolomics. On the one hand, cancer 
cells are exposed to limited nutrients owing to poor vasculature, and 
deregulated proliferation poses energetic and nutrient demands and 
liabilities, which act in concert with aberrant activation of growth 
signals. On the other hand, one of the most successful interventions 
against the onset of ageing is limitation of nutrient intake, or caloric 
restriction’. Hence, understanding normal nutrient-sensing mech- 
anisms is a prerequisite for designing better interventions against 
human disease beyond diabetes. m 
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Necroptosis and its role 


in inflammation 


Manolis Pasparakis' & Peter Vandenabeele*** 


Regulated cell death has essential functions in development and in adult tissue homeostasis. Necroptosis is a newly discovered 
pathway of regulated necrosis that requires the proteins RIPK3 and MLKL and is induced by death receptors, interferons, toll- 
like receptors, intracellular RNA and DNA sensors, and probably other mediators. RIPK1 has important kinase- dependent 
and scaffolding functions that inhibit or trigger necroptosis and apoptosis. Mouse-model studies have revealed important 
functions for necroptosis in inflammation and suggested that it could be implicated in the pathogenesis of many human 
inflammatory diseases. We discuss the mechanisms regulating necroptosis and its potential role in inflammation and disease. 


ell death is intricately connected with life in multicellular organ- 

isms. The balance between cell death, proliferation and differ- 

entiation is crucial for the maintenance of tissue homeostasis 
throughout life. Programmed cell death (PCD) is essential for many 
physiological processes, including the shaping of developing organs, epi- 
thelial cell renewal and lymphocyte selection. However, cell death that is 
not developmentally programmed is a sign of stress, injury or infection 
and is linked to tissue damage and disease pathogenesis. Inflammation is 
areaction of the immune system induced in response to infection or tissue 
injury that is essential for efficient host defence and tissue repair. However, 
uncontrolled excessive and/or prolonged inflammatory responses cause 
tissue damage and contribute to the pathogenesis of acute and chronic 
inflammatory diseases. Although recognized more than 150 years ago as 
acentral component of inflamed tissues, the potential role of cell death as 
an active component that contributes to tissue homeostasis, inflammation 
and disease pathogenesis has only recently gained attention’. Many of the 
receptors involved in inflammation can also induce cell death in addi- 
tion to their potent capacity to drive inflammatory cytokine expression. 
Recent findings suggest that in addition to the transcriptional regulation 
of inflammatory genes, the cell-death-inducing properties of these recep- 
tors also crucially contribute to inflammation. 

For many years apoptosis was considered to be the only form of regu- 
lated cell death (RCD), whereas necrosis was seen as an unregulated acci- 
dental cell death (ACD) process. Genetic, biochemical and functional 
evidence, and the discovery of specific chemical inhibitors of necrosis 
have redefined this process as a molecularly controlled regulated form 
of cell death”. Regulated necrosis includes several cell-death modalities 
such as necroptosis, parthanatos, ferroptosis or oxytosis, mitochondrial 
permeability transition (MPT)-dependent necrosis, pyroptosis and 
pyronecrosis, and cell death associated with the release of (neutrophil) 
extracellular traps, which is described as NETosis or ETosis (Box 1). 
Pyroptosis and necroptosis are the best characterized forms of regulated 
necrosis. Pyroptosis is an inflammatory form of cell death induced by 
inflammasome activation and has important functions in host defence 
and inflammation’. Necroptosis, mediated by receptor interacting protein 
kinase-3 (RIPK3) and its substrate mixed lineage kinase like (MLKL), is 
the best-characterized form of regulated necrosis. Recent studies have 
provided new and exciting insights into the mechanisms controlling 
necroptosis and its in vivo relevance and suggested that necroptosis could 


have important functions in the pathogenesis of several human diseases. 
This Review discusses the mechanisms controlling regulated necrosis 
and its physiological relevance for tissue homeostasis and inflammation, 
focusing particularly on necroptosis. 


Mechanisms and regulation of necroptosis 

Much of our knowledge of necroptosis comes from studies of tumour 
necrosis factor (TNF) signalling. TNF is a pleiotropic cytokine that has 
a key role in inflammation induced by infection or tissue injury. TNF 
signalling, primarily through TNF receptor 1 (TNFR1), induces the 
expression of many genes that regulate inflammation, but under some 
conditions TNF is also a potent inducer of cell death*. Despite its name 
and early evidence that TNF also induces caspase-independent cell death? 
by a mechanism involving RIPK1 (ref. 6), for many years most studies 
of TNF-induced cell death focused on apoptosis. The identification of 
necrostatins as necrosis inhibitors targeting RIPK1 provided evidence that 
TNF-induced necrosis is a kinase-regulated process, and it was dubbed 
necroptosis”*. A key step in unravelling the necroptosis pathway was the 
discovery of RIPK3 as an essential regulator of TNF-induced necrosis”. 
The RIP homotypic interaction motif (RHIM) on RIPK3 and RIPK1 
allows their interaction and is required for necroptosis induction”"®. The 
necrosome was defined as the complex containing RIPK1 and RIPK3 that 
was involved in the initiation of necroptosis’”. RIPK1 and RIPK3 were 
later shown to form large amyloid-like structures”’, although it is unclear 
whether these represent a real signalling platform or a post-event accu- 
mulation of these two interacting kinases. The important physiological 
role of necroptosis was highlighted by a number of genetic studies show- 
ing that caspase-8 or Fas-associated protein with death domain (FADD) 
deficiency cause embryonic lethality and trigger inflammation in vivo by 
sensitizing cells to RIPK3-mediated necroptosis'*. 


Execution of necroptosis 

The identification of MLKL pseudokinase as a substrate of RIPK3 
required for necroptosis sheds light on the mechanisms involved in exe- 
cuting necrotic cell death downstream of RIPK3 (refs 19, 20). Although 
initially MLKL was proposed to be associated with the regulation of mito- 
chondrial fission through the proteins phosphoglycerate mutase fam- 
ily member 5 (PGAMS) and dynamin-related protein 1 (DRP1)”', the 
causality of mitochondrial fission in necroptosis has been challenged”. 
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BOX1 


Regulated necrosis pathways 


Several types of regulated necrosis exist in addition to necroptosis 
(reviewed in ref. 2). Parthanatos involves hyperactivation of poly(ADP- 
ribose) (PAR) polymerase 1 (PARP 1), an enzyme originally characterized 
by its role in DNA-repair mechanisms following a DNA-damage 
response. The massive PARylation of target proteins leads to cellular 
depletion of NAD* (and consequently of ATP), resulting in a bioenergetic 
crisis and a form of regulated necrosis. The term ferroptosis was recently 
coined to describe a type of regulated necrosis that is characterized 

by iron-dependent production of reactive oxygen species (ROS), which 
can be blocked by the iron chelator desferrioxamine (DFO). It is elicited 
by pharmacological inhibition of the antiporter system x, , which 
exchanges extracellular cystine for intracellular glutamate. Glutamate 
toxicity on neurons works by blocking the same antiporter system 

and has been termed oxytosis. Mitochondrial permeability transition 
(MPT)-mediated regulated necrosis is another cell-death modality. 
Cyclophilin D (CYPD) is the sole genetically confirmed component of 
the permeability transition pore complex (PTPC), which is implicated 

in MPT-mediated regulated necrosis. Pyroptosis and pyronecrosis 

are highly inflammatory cell-death modalities characterized by 

cellular swelling and plasma membrane permeabilization. Pyroptosis 
occurs after canonical and non-canonical inflammasome stimulation, 


Table 1 | Inhibitors that interfere with cell-death modalities 


leading to caspase-1 and caspase-11 activation, respectively. Some 
reports distinguish pyronecrosis as a cell-death modality occurring 
during infection that does not depend on caspase-1 or caspase-11, 

but requires cathepsin B release following lysosomal membrane 
permeabilization. NETosis/ETosis (NET is neutrophil extracellular trap; 
ET is extracelluar trap) is also implicated in protection against microbial 
and viral infection. This modality occurs in neutrophils, eosinophils, mast 
cells and macrophages. It is associated with chromatin decondensation 
and release of NETs, which are composed of DNA, chromatin and 
histones, and allow immune cells to immobilize and kill infectious 
agents. Besides apoptosis and the various forms of regulated necrosis, 
a third modality is often put forward: autophagic cell death. The name 
is slightly misleading because autophagy is in the first instance a crucial 
mechanism in cellular homeostasis and adaptive responses, however, 
itcan also become a cell-death mechanism. Autophagic cell death is 
biochemically characterized by markers of autophagy such as lipidation 
of LC3 (microtubule-associated protein 1 light chain 3) and the 
degradation of p62, a ubiquitin-binding scaffold protein, and blocked 
by genetically or pharmacologically targeting the members of the 
autophagy pathway. Inhibitors that interfere with cell death modalities 
are shown in Table 1. 


Apoptosis Regulated necrosis 
Morphology 
Cytoplasmic shrinkage ncreasingly translucent cytoplasm Loss of nuclear 
Chromatin condensation (pyknosis) Swelling of organelles; lysosomal membrane permeabilization integrity 
Nuclear fragmentation (karyorrhexis) Increased cell volume (oncosis) Massive 
Blebbing of the plasma membrane — Permeabilization of the plasma membrane chromatin 
Shedding of apoptotic bodies Mild chromatin condensation; nuclei remain intact decondensation 
Death modality 
Intrinsic apoptosis Extrinsic Necroptosis Ferroptosis MPT-mediated Parthanatos — Pyroptosis NETosis/ETosis 
apoptosis regulated necrosis 
Death regulatory factors 
BID, BAX/BAK RIPK1* — RIPK1* GPX4 CYPD PARP1 nflammasome NOX 
Cytochrome c RIPK3' — RIPK1* Canonical NLR-ASC-CASP 1 
APAF1 FADD RIPK3* on-canonical (sensor/ 
CASP9 CASP8 adaptor?)-CASP11 
Death execution factors 
CASP3 MLKL GSH decrease Ca** increase NAD* increase ROS increase 
CASP7 ion channels Fe** ATP increase 
Lipid peroxidation, energetic catastrophe, and lysosomal and plasma-membrane permeabilization 
Synthetic inhibitor (factor they inhibit) 
zVAD-fmk (CASP) EC1 and NEC1s Fer-1 SfA (CYPD) 3-AB (PARP1) VX-740 (CASP1) DPI (NOX) 
q-VD-Oph (CASP) GSK Cpd27 (RIPK1) DFO CsA (CYPD) PJ-34 (PARP1) VX-765 (CASP1) GKT137831 
GSK843 (RIPK3) (NOX1 and 
GSK872 (RIPK3) NOX4) 
SA (HMLKL) 
GW906742X (MLKL) 
Physiology 
Controlling cell numbers during Embryogenesis? Glu toxicity IR-injury DNA damage __ Inflammation Inflammation 
embryogenesis and homeostasis Homeostasis? IR-injury Thrombosis Neurodegen. Pathogen defence Extracellular 
Immune regulation Inflammation Neurodeg. Transplant. trap formation 
Pathogen defence IR-injury Transplant. Pathogen 
Thrombosis defence 
Neurodegen. 


3-AB, 3-aminobenzamide; APAF1, apoptotic protease-activating factor 1; BAK, apoptosis regulator BAK; BAX, apoptosis regulator BAX; BID, BH3-interacting domain death agonist; CASP, caspase; 
Cpd27, compound 27 or fluro[2,3-d]pyrimidine 27; CsA: cyclosporine A; DPI, diphenylene iodonium; FADD, Fas-associated death domain; Fer-1, ferrostatin-1; Glu, glutamate; GSH, reduced 
glutathione; GPX4, glutathione peroxidase 4; MLKL, mixed lineage kinase like; NSA, necrosulfonamide MLKL inhibitor; NEC1, necrostatin-1; NEC1s, necrostatin-1s; Neurodegen., neurodegeneration; 
NLR, NOD-like receptor; NOX, NADPH oxidase; RIPK, receptor-interacting serine/threonine-protein kinase; qVD-Oph, quinolyl-Val-Asp-Oph; SfA, sanglifehrin a; Transplant., transplantation; TRPM7, 
transient receptor potential cation channel subfamily M member 7; 2VAD-fmk, carbobenzoxy-valyl-alanyl-aspartyl-[O-methy|]-fluoromethylketone. *Kinase active form, ‘kinase inactive form. 
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Moreover, depletion of mitochondria by parkin RBR E3 ubiquitin pro- 
tein ligase (PARK2) and/or carbonyl cyanide m-chlorophenylhydrazone 
(CCCP)-induced mitophagy activation did not prevent TNF-induced 
necroptosis, arguing against mitochondrial reactive oxygen species 
(ROS) asa final executioner mechanism™. Evidence suggests that RHIM- 
dependent oligomerization and intramolecular autophosphorylation of 
RIPK3 results in the recruitment and phosphorylation of MLKL**”, 
which leads to a conformational change in the pseudokinase domain lead- 
ing to the exposure of the 4-helical bundle domain”. Two non-exclusive 
models are proposed for the executioner mechanism of MLKL: one as a 
platform at the plasma membrane for the recruitment of Ca” or Na* ion 
channels””*, and one as a direct pore-forming complex that is recruited 
through binding of the amino-terminus of the 4-helical bundle domain of 
MLKL to negatively charged phosphatidylinositolphosphates””’. 


Mechanisms regulating TNFR1-induced necroptosis 
In most cells, TNFR1 stimulation is not cytotoxic and induces direct 
pro-inflammatory signalling through the formation of a membrane- 
associated protein complex (complex I) (Fig. 1a)*. In sensitized cells, 
TNEFRI induces apoptosis through cytosolic complexes Ila and IIb, and, 
in particular conditions or cells, necroptosis by the necrosome can be 
executed. TNF-induced signalling towards necroptosis is prevented 
by several brakes on RIPK1, as the presence or absence of RIPK1 and 
its post-translational modifications such as ubiquitylation and phos- 
phorylation are imperative for the biological outcome. RIPK1 comes 
in several flavours in the four different complexes (I, Ha, IIb and Ic/ 
necrosome), and these are induced in a dynamic way after TNF bind- 
ing to TNFR1. Ligation of TNFR1 induces the formation of complex I, 
comprising TNFR1-associated death domain protein (TRADD), 
RIPK1 and the E3 ubiquitin ligases TNF-receptor-associated factor 2 
(TRAE2), the cellular inhibitors of apoptosis (cIAP1 or cIAP2) and 
the linear ubiquitin chain assembly complex (LUBAC, consisting of 
haem-oxidized IRP2 ubiquitin ligase-1, HOIL-1L; HOIL-1-interacting 
protein, HOIP; and SHANK-associated RH domain-interacting pro- 
tein, SHARPIN). TRADD is required for the recruitment of TRAF2, 
cIAP1/2 and LUBAC, and for the ubiquitylation of RIPK1 with K63 and 
linear chains within complex I. This complex favours proinflammatory 
signalling and prevents cell death through the recruitment of the TGF- 
activated kinase 1 (TAK1)-TAK1-binding protein (TAB) complex and 
of the IxB kinase (IKK) complex consisting of IKK1, IKK2 and NF-KB 
essential modulator (NEMO), leading to NF-«B and mitogen-activated 
protein kinase (MAPK) activation (reviewed in ref. 32). 
Destabilization of complex I leads to the formation of a second cyto- 
solic complex IIa, consisting of TRADD, FADD and caspase-8, which 
signals towards apoptosis”. In conditions such as TNF stimulation 
in the presence of IAP inhibitors (Smac mimetics) or knockdown of 
IAPs*®, TAK1 inhibition or knockdown*’*, NEMO knockdown” or Pel- 
lino knockdown”, a cytosolic complex IIb forms that is composed of 
RIPK1, RIPK3, FADD and caspase-8. This composition resembles the 
cytosolic ripoptosome complex, which forms independently of TNE, 
Fas ligand (FASL) or TNF-related apoptosis-inducing ligand (TRAIL) 
following the loss or inhibition of IAPs”. This complex IIb favours 
RIPK1-kinase-activity-dependent apoptosis, however when the lev- 
els of RIPK3 and MLKL are sufficiently high and caspase-8 activity 
is reduced, blocked or absent, complex IIb may evolve to form the 
necrosome. Caspase-8 is reported to inhibit necroptosis by cleaving 
RIPK1 (ref. 41) and RIPK3 (ref. 42), as well as CYLD”, a deubiquit- 
inating enzyme that removes ubiquitin chains from RIPK1 and con- 
tributes to necroptosis in vitro and in vivo!”'®4 but the contribution of 
the cleavage of each of these proteins to necroptosis inhibition remains 
unclear. Expression levels of cellular FADD-like interleukin (IL)-1f- 
converting enzyme (FLICE)-inhibitory protein (FLIP,) are crucial in 
the control of necroptosis and apoptosis. The presence of high levels 
of FLIP, leads to heteromeric caspase-8-FLIP, in complex II, which is 
catalytically active but does not lead to full processing of p10-p20 cas- 
pase-8 (and consecutive apoptosis) and alters the substrate specificity”. 


REVIEW 


Heteromeric caspase-8—-FLIP, prevents complex Ila-dependent apo- 
ptosis. The precise mechanism of the paradoxical prosurvival role of 
caspase-8 in inhibiting necroptosis is not completely clear, but requires 
the presence of FLIP,”, catalytically active caspase-8 (ref. 46), but not its 
proteolytic processing” and suppression of RIPK1-RIPK3 activation”. 
Thus, FADD-caspase-8-FLIP, -mediated control of complex IIb is a 
second important break preventing the induction of necroptosis. The 
existence of these brakes (IAPs, TAK1, caspase-8 and FLIP,) explains 
why in most studies of necroptosis they are blocked by a combination of 
Smac mimetics or TAK] inhibitor (inhibiting brake 1) and zVAD-fmk 
(carbobenzoxy-valyl-alanyl-aspartyl-[O-methy]] -fluoromethylketone, 
inhibiting brake 2), facilitating the efficient formation of the necrosome 
as long as RIPK3 and MLKL are sufficiently expressed. Most cells are 
not sensitive to TNF-induced cell death. Adding the protein synthesis 
inhibitor cycloheximide renders cells sensitive for complex Ila-mediated 
apoptosis”, and, when caspase-8 is inhibited, to necrosome-mediated 
necroptosis’, but the precise protective proteins targeted by cyclohex- 
imide remain unclear. 


Other stimuli inducing necroptosis 

There are six human death receptors (DRs) in the TNF superfamily: 
TNERI, FAS (also known as CD95 or APO-1), DR3 (also known as 
TRAMP or APO-3), TRAILR1 (also known as DR4), TRAILR2 (also 
known as DR5, TRICK or KILLER), and DR6 (ref. 48). In contrast to 
TNERI signalling, in which the prosurvival signalling complex forms 
first and the death-inducing complexes form subsequently in sensitized 
cells (see earlier), binding of FASL to FAS, or of TRAIL to TRAILRI or 
TRAILR2 induces the assembly ofa membrane-associated death-induc- 
ing signalling complex (DISC) through the adaptor protein FADD, lead- 
ing to recruitment and activation of caspase-8 and consecutive apoptosis 
(Fig. 1b). Under particular conditions such as the absence of cIAPs, which 
favours the recruitment of RIPK] to Fas” and the formation ofa cytosolic 
ripoptosome complex ITb”, these ligands also mediate necroptosis when 
caspase-8 is blocked. 

Cellular stress, damage and infection are sensed by several receptors 
such as Toll-like receptors (TLRs), nucleotide-binding and oligomeri- 
zation domain (NOD)-like receptors (NLRs), retinoic acid-inducible 
gene I (RIG-I)-like receptors (RGRs), ripoptosome and protein kinase R 
(PKR) complexes. Some of these receptors have also been demonstrated 
to induce necroptosis. Activated TLR3 forms an endosome platform 
recruiting a cytosolic adaptor, Toll/IL-1 receptor (TIR) domain-con- 
taining adaptor protein inducing interferon (IFN)-B (TRIB), which is 
involved in NF-«B activation and induction of type I IFNs”’. Besides 
signalling through the myeloid differentiation primary response 88 
(MyD88) adaptor, TLR4 also signals through TRIF”. TRIF contains a 
RHIM-domain, allowing interaction with RIPK1 and RIPK3. Lipopoly- 
saccharide (LPS)-TLR4 or polyinosine-polycytidylic acid (poly(I:C))- 
TLR3 stimulation in the presence of zVAD-fmk induce TRIF-mediated 
necroptosis that depends on RIPK3 and MLKL, but may also proceed in 
the absence of RIPK1 (refs 51, 52). However, inhibition of RIPK1 kinase 
activity by necrostatin-1 (Nec1)*’” or in knock-in macrophages express- 
ing kinase-inactive RIPK1?"**’ (ref. 53) prevented LPS-poly(I:C)-zVAD- 
fmk-induced necroptosis. These results suggest that RIPK1 is probably 
recruited to the TRIF-signalling complex and when its kinase activity is 
inhibited it acts as a blocker of TLR3- or TLR4-induced RIPK3 activation, 
although TRIF may also directly recruit and activate RIPK3 in the absence 
of RIPK] (Fig, 1c)”’”*. Another RHIM-domain-containing protein is the 
cytosolic DNA-dependent activator of IFN regulatory factors (DAI; also 
known as Z-DNA binding protein 1, ZBP1), which in response to viral 
double-stranded DNA induces NF-«B activation and type I IFNs, but also 
RIPK3-mediated necroptosis™ (Fig. 1d). 

In the absence of caspase-8 and FADD, both type I and type II IFNs 
were recently shown to induce RIPK1-RIPK3 necrosome formation in 
mouse embryonic fibroblasts (MEFs) that depends on the viral RNA 
responsive protein kinase R, which interacts with RIPK1 (ref. 55). How- 
ever, independent experiments in PKR-deficient macrophages do not 
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support an essential role for this kinase in IFN-a receptor type I-mediated 
necroptosis. Interestingly, depending on the cell type, autocrine loops 
may be implicated in the regulation of necroptosis. In macrophages, 
LPS-TLR4-, TNF-TNFR1- and poly(I:C)-TLR3-mediated necroptosis 
requires IFN-a receptor type I signalling, suggesting an autocrine loop 
of type I IFNs°° (Fig. le). Moreover, stimulation of Pam3CysK-TLR2, 
Flagellin-TLR5, CpG-TLR9 (ref. 52) or etoposide administration” leads 
to an auto- or paracrine TNF-production loop, which in conditions of IAP 
inhibition or caspase-8 depletion sensitizes cells to TNF-induced death 
by RIPK1-dependent necroptosis. Therefore, regulation of necroptosis 
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Figure 1 | Pathways leading to necroptosis. a, The binding of TNF to its 
receptor (TNEFR1) induces the formation of the receptor-bound complex I 
(made up of TRADD, RIPK1, TRAF2, IAP1, IAP2 and LUBAC) that activates 
NF-«B and AP-1 and mitogen-activated protein kinase (MAPK) signalling by 
the ubiquitin-chain-dependent recruitment of the IKK (made up of NEMO, 
IKK1 and IKK2) and TAB/TAK-1 complexes. In CHX-treated cells, the 
cytoplasmic complex IIa (made up of TRADD, FADD and caspase-8) forms, 
which leads to caspase-8-mediated apoptosis independent of RIPK1. In cells 
treated with IAP antagonists, TAK1 inhibition or knockdown, or NEMO 
knockdown the cytoplasmic complex IIb (ripoptosome-like) forms (made up 
of RIPK1, RIPK3, FADD and caspase-8), resulting in caspase-8-dependent 
apoptosis, which depends on RIPK1 kinase activity and an RIPK3 platform 
(apoptosis*). FLIP, keeps caspase-8 in a heteromeric complex that controls 
RIPK1 and RIPK3 levels by proteolysis. When caspase-8 is inhibited complex IIc 
or the necrosome (made up of RIPK1, RIPK3 and MLKL) is formed, inducing 
RIPK] kinase activity and RIPK3-kinase-activity-dependent necroptosis. b, 
Stimulation of Fas or TRAILR induces the formation of the receptor-bound 
death-inducing signalling complex (DISC) that triggers caspase-8-mediated 
apoptosis independent of RIPK1. In the presence of IAP antagonists, Fas and 
TRAILR stimulation results in the recruitment of RIPK1 generating a complex 
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could be a crucial factor in controlling responses to infection, DNA dam- 
age and inflammation. 


RIPK1 determines cell survival or death 

Genetic studies have demonstrated the pivotal role of RIPK1 in cell sur- 
vival and death. RIPK1 deficiency causes perinatal lethality’, which 
is fully prevented by the combined absence of caspase-8 and RIPK3 
(refs 58-60). Caspase-8 deficiency did not prolong survival of Ripk1~ 
mouse neonates, although it prevented apoptosis in many tissues includ- 
ing the intestine, thymus, liver and lung, whereas RIPK3 or MLKL 
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similar to complex I in (a), which can progress either in the formation of the 
cytosolic ripoptosome that induces caspase-8-mediated apoptosis that depends 
on RIPK1I kinase activity or, when caspase-8 activity is inhibited, RIPK1- 
kinase-activity-dependent necroptosis. c, TLR4 or TLR3 stimulation triggers 
formation of the necrosome through the RHIM-containing adapter TRIE, 
resulting in RIPK3-dependent necroptosis in which the role of RIPK1 depends 
on the cellular context. d, DNA-dependent activator of IFN regulatory factors 
(DAI) recognizes viral double-stranded DNA and through its RHIM domain 
recruits RIPK3 and induces the formation of the necrosome without RIPK1 and 
triggers RIPK1-independent RIPK3-kinase-activity-dependent necroptosis. e, 
In bone-marrow-derived macrophages type I interferons (IFNa and B) induce 
necroptosis through their cognate receptor, IFNARI, leading to activation of 
JAK1. These cause the formation of the ISGF3 complex (STAT1-STAT2-IRF9), 
which in a transcription-dependent way causes induction and activation of 

the necrosome complex. TNF (IRF1) and lipopolysaccharide (LPS; IRF3/7) 
signalling can induce an autocrine loop for IFNf. In macrophages, IFNARI is 
required for TNF- and LPS-induced necroptosis. In the presence of zVAD-fmk, 
sustained activation of the necrosome eventually results in necroptosis. The 
‘glowing’ symbols represent enzymatic activity. Homotypic interaction motif 
(RHIM domain) is indicated by the blue line. 
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deficiency ameliorated systemic inflammation, prevented epidermal 
hyperplasia and marginally prolonged the survival of Ripk1’ pups, 
revealing distinct functions for apoptosis and necroptosis in the mul- 
tiorgan pathology and perinatal death of RipkI” mice”. In addition, 
intestinal epithelial cell (IEC)-specific knockout of Ripk1 caused severe 
lethal intestinal pathology due to FADD-caspase-8-mediated apoptosis 
of IECs induced primarily, but not exclusively, by TNF*™. Importantly, 
raising IEC-specific Ripk1-knockout mice or Ripk1™ Ripk3” mice under 
germ-free conditions did not prevent apoptosis of IECs and intestinal 
pathology, demonstrating that the microbiota is not essential to trigger 
the death of RIPK1-deficient epithelial cells, although antibiotic stud- 
ies suggested that bacteria may aggravate the phenotype”. Epidermal- 
keratinocyte-specific Ripk1 knockout, however, triggered severe skin 
inflammation by sensitizing keratinocytes to RIPK3-MLKL-dependent 
necroptosis, revealing a novel role for RIPK1 as an inhibitor of necroptosis 
in keratinocytes”. Furthermore, RIPK1 deficiency in haematopoietic cells 
caused bone marrow failure owing to haematopoietic cell apoptosis and 
necroptosis™”. Collectively, these studies demonstrated the essential role 
of RIPK1 in preventing apoptosis and necroptosis in vivo. Knock-in mice 
expressing kinase inactive Ripk1 alleles did not show postnatal lethality 
or tissue pathology*””*""**, demonstrating that RIPK1 mediates cell 
survival by kinase-independent scaffolding functions. The pro-survival 
functions of RIPK1 that are important for preventing IEC apoptosis seem 
to be independent of NF-«B activation’. Although the exact mecha- 
nisms by which RIPK1 prevents cell death remain to be elucidated, RIPK1 
deficiency but not lack of its kinase activity resulted in degradation of 
cIAP1, FLIP, and TRAF2 in vivo and in response to TNF stimulation 
in vitro, suggesting that kinase-independent RIPK1 scaffolding properties 
are important to sustain pro-survival signalling platforms’. 

RIPK1 kinase activity is not required for activation of NF-KB and 
MAPK signalling but mediates cell death by either apoptosis or necrop- 
tosis. On the one hand, lack of RIPK1 kinase activity partly inhibited 
but did not prevent RIPK3-mediated necroptosis of FADD-deficient 
IECs and keratinocytes, demonstrating the existence of RIPK1-kinase- 
activity-dependent and -independent pathways that induce necroptosis 
in vivo”. On the other hand, lack of RIPK1 kinase activity as well as RIPK3 
deficiency protected mice from TNF-induced systemic inflammatory 
response syndrome (SIRS)***"”, demonstrating that kinase-dependent 
pro-death functions of RIPK1 rather than direct proinflammatory gene 
induction are crucial for TNF-induced SIRS, as previously proposed”. 


Mechanisms determining apoptosis or necroptosis 
Despite the advances in unravelling the pathways that regulate necropto- 
sis, the precise mechanisms determining the decision whether a cell will 
die by apoptosis or necroptosis remain poorly understood. On the posi- 
tive regulatory side, several studies have suggested that expression levels 
of RIPK3 and MLKL correlate with sensitivity to necroptosis'*'77»*”, 
However, a potential drawback of these studies is that RIPK3 and MLKL 
expression was compared between healthy and inflamed tissues; it is, 
therefore, difficult to conclude that RIPK3 and/or MLKL overexpression 
has primary causal functions and is nota secondary consequence of ongo- 
ing inflammation. Interestingly, some studies suggest that whether RIPK3 
is catalytically active or not may determine necroptosis or apoptosis. As a 
catalytically inactive platform RIPK3 favours RIPK1-dependent apopto- 
sis, whereas catalytically active RIPK3 induces necroptosis”*. This bifurca- 
tion was illustrated in vivo by the finding that, although RIPK3-knockout 
mice have no spontaneous phenotype, RIPK3”’*’’-kinase-inactive knock- 
in mice die during embryogenesis due to RIPK1-FADD-caspase-8 
mediated apoptosis”, suggesting that kinase-active RIPK3 is required 
to suppress apoptosis. However, a recent report showed that RIPK3°“ 
kinase inactive knock-in mice are viable and that RIPK3 inhibitors block 
TNF-induced necroptosis, but at higher concentrations induce RIPK3- 
scaffold-dependent apoptosis, suggesting that RIPK3 conformational 
changes and not inhibition of its kinase activity trigger apoptosis”. 

On the negative regulatory side, caspase-8 is the most crucial factor for 
preventing necroptosis. Indeed, in most in vivo experimental systems thus 
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far, sensitization to necroptosis was achieved by a genetic defect compro- 
mising FADD-caspase-8 signalling, and thus inhibiting apoptosis'*™*. 
cIAP loss sensitizes cells to apoptosis” * and — when caspase-8 is inhib- 
ited — to necroptosis"”, suggesting that cIAP inhibition confers suscepti- 
bility to cell death but that the decision whether a cell dies by apoptosis or 
necroptosis depends on caspase-8 activity. As already discussed, RIPK1 
deficiency sensitizes cells to both apoptosis and necroptosis® , although 
the factors determining whether a cell that lacks RIPK1 will die by apo- 
ptosis or necroptosis remain elusive. Therefore, more in vivo studies are 
required to understand the mechanisms that determine the susceptibility 
of different cell types to necroptosis or apoptosis. 


Regulated cell death in inflammation 

Although initially cell death was considered to be the result of inflam- 
mation, more recently the concept that cell death may precede, trigger 
or amplify the inflammatory response has gained increasing attention’. 
Establishing cell death as an initiator of inflammation in vivo is challeng- 
ing, because it is very difficult to temporally resolve the two responses 
to demonstrate causality. In the next section, we discuss the evidence 
implicating necroptosis as an active mediator of inflammation, focusing 
particularly on the implications of these findings for the pathogenesis of 
inflammatory diseases — other aspects related to the role of regulated cell 
death in pathogen defence have been covered in other recent reviews”””*. 


Necroptosis as a trigger of inflammation 
A major obstacle in studying the role of necroptosis in vivo has been 
the lack of a definite molecular marker for the in situ identification of 
necroptotic cells. Antibodies that detect human phosphorylated MLKL, 
which were recently reported to detect necrotic hepatocytes in the liver of 
people with drug-induced liver injury’, promise to provide such a tool, 
but additional validation will be required to demonstrate their applicabil- 
ity for necroptosis detection in other human tissues. The development 
of mouse-specific phospho-MLKL antibodies will be invaluable for the 
preclinical validation of phosphorylated MLKL as a marker of necroptotic 
cells in mouse models of human disease. So far, in most published studies 
necroptotic cells are described by necrotic morphology and concomitant 
absence of caspase-3 activation, which cannot distinguish between dif- 
ferent types of necrotic cell death (Box 1). In the absence of a specific 
molecular marker, the only definite criterion for necroptosis has been 
dependence on RIPK3 and MLKL. Although NECI has been used exten- 
sively as an inhibitor of necroptosis, in light of the most recent findings 
that show the existence of RIPK1-kinase-activity-dependent and -inde- 
pendent necroptosis in vivo", as well as the complex functions of RIPK1 
kinase activity in regulating both necroptosis and apoptosis” and the 
inhibitory effects of NEC1 on indoleamine 2,3-oxygenase”’, studies based 
on NEC] cannot be considered as definite evidence for necroptosis and 
need to be reproduced in a RIPK3- and/or MLKL-deficient background. 
Therefore, in this Review, we will only discuss in vivo studies that support 
a proinflammatory role for necroptosis by demonstrating that cell death 
and inflammation are prevented by RIPK3 and/or MLKL deficiency. 
Invivo evidence to supporta proinflammatory function for necroptosis 
was initially provided by studies in mice that lack FADD in IECs, which 
had increased numbers of necrotic IECs and developed spontaneous colitis 
and ileitis with loss of Paneth cells'’. RIPK3 deficiency prevented epithelial 
cell death and inflammation in both the colon and the small intestine of 
these mice, providing in vivo experimental evidence that RIPK3-mediated 
necroptosis of epithelial cells causes intestinal inflammation’’. Germ-free 
conditions and MyD88 or TNF deficiency prevented colitis, but not Paneth- 
cell loss and ileitis in mice lacking FADD in IECs, suggesting that differ- 
ent mechanisms trigger RIPK3-mediated epithelial cell necroptosis and 
inflammation in the colon and the small intestine’. IEC-specific knockout 
of caspase-8 caused Paneth cell loss and ileitis”, which was subsequently 
shown to depend on RIPK3, but not on TNFR1 as initially suggested®. 
Interestingly, in contrast to IEC-specific FADD knockouts, IEC-specific 
caspase-8 knockout mice did not develop colitis, suggesting that FADD and 
caspase-8 have different functions in the colonic epithelium. 
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Mice with keratinocyte-specific FADD deficiency developed RIPK3- 
dependent keratinocyte necroptosis and skin inflammation, suggesting 
that epithelial cell necroptosis also triggers inflammation in the skin™. 
Keratinocyte-specific caspase-8 knockout mice also developed skin 
inflammation, which was suggested to be caused by the loss of caspase- 
8-dependent suppression of RIG-I-IRF3 signalling*’*”. However, mito- 
chondrial antiviral-signalling protein (MAVS) deficiency did not prevent 
skin inflammation in mice with inducible ablation of caspase-8, arguing 
against an important role for the RIG-I-MAVS-IRF3 axis”. By contrast, 
RIPK3 deficiency prevented skin inflammation in these mice®™, further 
supporting the idea that RIPK3-dependent keratinocyte necroptosis trig- 
gers inflammation in mice lacking caspase-8 or FADD in the epidermis. 
In addition, epidermis-specific RIPK1 knockout caused keratinocyte 
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Figure 2 | Regulated cell death triggers inflammation. a, Epithelial cell 
death in barrier tissues can cause barrier disruption, allowing commensal 

or environmental microbes to invade the tissue. Recognition of microbial 
pathogen-associated molecular patterns (PAMPs) by pattern recognition 
receptors on myeloid or stromal cells induces the expression of cytokines and 
chemokines that attract and activate immune cells resulting in inflammation. 
Cytokines expressed by immune cells (for example, TNF) could trigger the 
death of additional epithelial cells, further compromising the barrier in a vicious 
circle, resulting in chronic non-resolving inflammation. In this setting, DAMP 
release may have a limited contribution as the presence of microbial PAMPs 
could fully drive the inflammatory response. It is unclear whether apoptosis and 
necroptosis have a differential capacity to induce barrier disruption. b, In sterile 
tissues, apoptotic cell death is considered a weak inducer of inflammation, as the 
orderly disassembly of the dying cell allows no, or limited, release of DAMPs. 

By contrast, the massive release of DAMPs by disintegrating necroptotic cells is 
believed to be a strong trigger inducing inflammation (see Box 2). 
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necroptosis and skin inflammation that was prevented by RIPK3 or 
MLKL deficiency”, whereas Ripk1~” newborn pups developed RIPK3- 
and MLKL-dependent skin hyperplasia”, providing additional evidence 
that necroptosis triggers inflammation. TNFR1-deficiency ameliorated, 
but could not fully prevent inflammation in these mice, suggesting that 
although TNF is a major driver of cell death and inflammation, TNF- 
independent pathways also contribute in these models. 

In addition to intestinal and skin epithelia, necroptosis of several other 
cell types was shown to trigger inflammation. RIPK3-dependent necro- 
sis of retinal pigment epithelial cells induced inflammation in a mouse 
model of double-stranded RNA-induced retinal degeneration™. Moreo- 
ver, RIPK3-deficient mice were protected from TNF or TNF-zVAD-fmk 
induced SIRS, demonstrating that necroptosis has an important role in 
TNF-induced shock”***”. Two studies showed that RIPK3-mediated 
necroptosis contributes to caecal ligation and puncture-induced sep- 
sis””*, although another study failed to confirm the function of RIPK3 
or MLKL in this model* of polymicrobial sepsis. In addition, RIPK3- and 
MLKL-dependent necroptosis was shown to exacerbate tissue injury and 
inflammation in a mouse model of acute pancreatitis'”**, whereas RIPK3 
deficiency was protective in a mouse model of Gaucher’s disease”. Fur- 
thermore, RIPK3 deficiency ameliorated ischaemia-reperfusion-induced 
injury and inflammation and prolonged kidney and heart allograft sur- 
vival**°, suggesting that necroptosis of graft cells could bea crucial factor 
in triggering inflammation contributing to transplant rejection. 


How does regulated cell death cause inflammation? 
Cell death could induce inflammation indirectly, by disrupting epithelial 
barriers triggering microbe-driven immune responses (Fig. 2a). This is 
particularly relevant in the intestinal epithelium, which provides a barrier 
between luminal bacteria and the mucosal immune system. IEC death 
usually triggers intestinal inflammation in different mouse models and 
here it is unclear whether apoptosis and necroptosis differ in their capacity 
to induce inflammation. Because barrier disruption not only depends on 
cell death but also on the capacity of the remaining cells to proliferate and 
keep the barrier sealed, it is important to study whether necroptosis and 
apoptosis differentially regulate key barrier-related properties of neigh- 
bouring epithelial cells such as proliferation, migration and adhesion. 
Ina sterile setting, dying cells could directly trigger inflammation by 
releasing factors collectively described as damage-associated molecular 
patterns (DAMPs) (Box 2). Apoptosis is generally considered to be non- 
immunogenic because the orderly disassembly of apoptotic cells allows 
no or limited DAMP release (Fig. 2b). However, there is a general consen- 
sus that necroptosis directly triggers inflammation by a massive release 
of DAMPs from the disintegrating cell’’ (Fig. 2b). Although this notion 
is clearly supported by evidence that DAMPs are released by necrotic 
cells and that specific DAMPs are important mediators of inflammation 
in vivo”, at present there are no in vivo studies that provide direct experi- 
mental proof that necroptosis-induced inflammation depends on spe- 
cific DAMPs. Such experimental proof might be difficult to obtain using 
genetic models owing to functional redundancy between DAMPs, and 
the important intracellular functions of several DAMPs. Indeed, surpris- 
ingly, myeloid-, hepatocyte- or pancreas-specific knockout of HMGB1, 
a prototypic DAMP, did not ameliorate but instead exacerbated LPS- or 
injury-induced damage and inflammation owing to an important role 
for HMGB1 in maintaining genome homeostasis and cell survival and 
preventing histone release” ™. Antibody neutralization may therefore be 
more suitable for addressing the functional role of specific DAMPs such 
as HMGB1, although this approach can not address the cellular origin of 
DAMPs to formally demonstrate that DAMP release by specific necrotic 
cells causes inflammation. IL-1 family cytokines (IL-1a, IL-1, IL-18, 
IL-33, IL-36 and IL-37) constitute important DAMPs and their role as 
mediators of necroptosis-induced inflammation needs to be addressed 
in vivo in relevant genetic models. In the absence of conclusive functional 
in vivo studies, the role of specific DAMPs as mediators of necroptosis- 
induced inflammation is currently largely based on correlative evidence 
and nice schemes in reviews, and awaits in vivo experimental validation. 
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Damage-associated molecular patterns (DAMPs) collectively describes 
the molecules and cellular components that are released or exposed 
by dying, injured or stressed cells and that act as danger signals to alert 
the immune system. DAMPs include cytokines and alarmins that are 
released mainly by dying cells such as the interleukin-1 family cytokines 
IL-la and IL-33, as well as the S100 proteins S100A8, S100A9 and 
$100A12. In addition, several cellular components that are normally 
found inside the cell performing important and predominantly non- 
immunological functions, are released only by dying or damaged cells 
to actas DAMPs. These include nucleic acids and ribonucleoproteins, 
histones and HMGB family members, heat-shock proteins, 


Damage-associated molecular patterns 


mitochondrial N-formyl peptides and DNA, or even intact mitochondria, 
F-actin, calreticulin, but also molecules such as monosodium urate 

and ATP (reviewed in ref. 107). DAMPs are usually detected by pattern 
recognition receptors that activate immune responses by inducing 

the expression of cytokines and chemokines. DAMPs are primarily 
released by cells undergoing necrosis, although apoptotic cells were 
recently shown to expose or release DAMPs!2. DAMPs may undergo 
modifications that inhibit their immunogenicity, for example caspase- 
dependent proteolysis of IL-33 or oxidation of HMGB1 (refs 123-125). 
Therefore, the type of cell death may determine the nature, quantity and 
immunogenicity of the DAMPs released. 


Necroptosis-dependent and -independent functions of RIPK3 
Genetic rescue experiments using RIPK3 or MLKL knockouts are rou- 
tinely used to demonstrate that necroptosis triggers inflammation. How- 
ever, it remains unclear whether in some cases RIPK3 might be capable of 
triggering inflammation by exerting necroptosis-independent functions. 
RIPK3 has been implicated in inflammasome activation and IL-1 release 
in Smac-mimetic-treated macrophages or in caspase-8-deficient den- 
dritic cells”*”*, apparently in a cell-death-independent manner, although 
multiple independent studies failed to show a role for RIPK3 in canonical 
or non-canonical inflammasome activation” ”°. RNA viruses, however, 
were recently reported to activate the NLRP3 inflammasome in a RIPK1- 
and RIPK3-dependent but MLKL-independent manner, suggesting that 
in response to viral RNA detection RIPK1 and RIPK3 induce inflam- 
masome activation in a necroptosis-independent manner that seems 
to involve mitochondrial fission and ROS production™. These studies 
suggest that regulation of the inflammasome by RIPK3 may contrib- 
ute to inflammation, although available in vivo data do not support an 
important role for the inflammasome in RIPK3-dependent inflammation, 
because, for example, double IL-1a and IL-1 deficiency did not prevent 
inflammation in mice with epidermal keratinocyte-specific knockout 
of caspase-8 (ref. 81). Furthermore, a recent study reported that RIPK3 
regulates LPS-induced NF-«B and caspase-1 activation and inflamma- 
tory cytokine production in dendritic cells but not in macrophages, and 
controls dextran-sodium-sulphate-induced colon inflammation”. 
Several studies have shown that caspase-8 activates the inflammasome 
and also directly cleaves pro-IL-1 to secrete IL-1B"°"””'""™, suggest- 
ing that it could induce inflammation independently of its pro-apoptotic 
functions. However, most were performed on macrophages lacking 
both caspase-8 and RIPK3, as caspase-8-knockout macrophages do not 
survive, raising the question of whether the observed effects might, to 
some extent, depend on the role of caspase-8 in inhibiting necroptosis. 
An alternative explanation for these findings could be that inflamma- 
some activation is coupled to cell death in cells that are primed to express 
inflammasome components and IL-1-family cytokine precursors. Cas- 
pase-8 could directly cleave IL-1 precursor proteins and perhaps also cas- 
pase-1 during apoptosis, whereas RIPK3-MLKL-dependent necroptosis 
could trigger inflammasome activation by inducing changes in the redox 
state, intracellular ion concentrations and the metabolic status of the cell, 
all of which are well-known inflammasome inducers’. This hypothesis 
is intriguing in light of recent findings showing that IL-1 is released 
primarily from dying macrophages in response to inflammasome acti- 
vation’. Considering that most stimuli known to act as inflammasome 
activators in primed macrophages inflict cellular damage, activation of the 
inflammasome could represent a unifying mechanism endowing primed 
cells with the capacity to induce inflammation when they are exposed to 
cell-death-inducing stress. Coupling regulated cell death with inflamma- 
some activation in primed cells could provide a mechanism to ensure that 
the death of cells exposed to stimuli that activate NF-«B, such as microbial 


products or inflammatory cytokines, will trigger an immune response to 
mediate host defence and promote tissue repair. The potent capacity of 
most stimuli that trigger regulated cell death (for example, TNE IFNs and 
LPS) to also induce proinflammatory gene expression ensures that in a 
population of cells mixed responses occur between cells that survive or 
die, and that dying cells have also been primed to produce cytokines and 
chemokines’. Therefore, additional studies in relevant in vivo models 
will be needed to assess the contribution of cell-death-dependent and 
-independent functions of RIPK3 and caspase-8 in inflammation. 


Is necroptosis more inflammatory than apoptosis? 
Apoptosis is generally considered to be non-immunogenic, based on the 
concept that developmentally programmed cell death, such as that dur- 
ing thymocyte selection in the thymus, should not trigger inflammation. 
However, apoptotic cell death that is not developmentally programmed 
indicates tissue injury and should be detected by the immune system 
to ensure efficient tissue regeneration and host defence. Indeed, several 
recent studies supported a proinflammatory role of apoptosis. Apoptosis 
of cancer cells in response to some chemotherapeutic agents was shown to 
be highly immunogenic, contributing to therapy responses (reviewed in 
ref. 107). Mouse-model studies suggested that apoptosis of epithelial cells 
lacking NEMO or TAK triggers chronic inflammation’, although 
the potential role of necroptosis in these models has not been addressed. 
Compelling evidence that epithelial cell apoptosis induces inflamma- 
tion comes from studies showing that inducible FLIP, knockout caused 
keratinocyte apoptosis and inflammation**”, which could not be pre- 
vented by RIPK3 deficiency”. In addition, skin inflammation in chronic 
proliferative dermatitis mice was recently shown to depend primarily 
on TNFR1-TRADD-RIPK1-dependent death of SHARPIN-deficient 
keratinocytes mediated primarily by FADD-caspase-8-dependent apop- 
tosis, whereas RIPK3-dependent necroptosis contributes to disease sever- 
ity, but is not essential for the induction of inflammation™’*"™. 
Necroptosis is believed to be a more potent inducer of inflammation 
than apoptosis but this concept has not been rigorously tested in relevant 
in vivo experimental models. Clearance of dying cells is crucial in limit- 
ing inflammation’. Although both apoptotic and necroptotic cells are 
recognised and cleared by phagocytes, the mechanism may be different 
owing to profound differences in the morphology, blebbing and apoptotic 
body formation in apoptosis compared with oncosis and rapid plasma 
membrane permeabilization in the case of necroptosis. The more acute 
release of DAMPs could support increased proinflammatory properties 
of necroptosis; however, as already discussed, the role of specific DAMPs 
in necroptosis-induced inflammation in vivo has not been function- 
ally validated. Particularly considering that co-incubating apoptotic or 
necrotic cells with macrophages did not result in cytokine induction”, 
the in vivo context may be very relevant for this type of experiment. To 
allow a direct comparison of apoptosis and necroptosis, ideally they would 
need to be induced in a similar number of cells in the same tissue. Recent 
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Figure 3 | Regulated cell death fuels the vicious circle in chronic 
inflammation. Regulated cell death could play a part in the initiation and 
amplification or chronicity of inflammation. This form of cell death can 

induce immune activation and cytokine expression indirectly through barrier 
disruption, allowing microbial entry, or directly by the release of DAMPs. 
Activated immune cells produce cytokines that induce inflammation directly 
by activating the expression of proinflammatory genes but also trigger regulated 
cell death, closing a circle that amplifies the inflammatory response. Infection, 
injury and stress can initiate the response by inducing regulated cell death 
directly and/or by triggering immune-cell activation and cytokine production. 


results showing that necroptosis, but not apoptosis, of RIPK1-deficient 
keratinocytes triggers skin inflammation provide initial in vivo experi- 
mental evidence that, at least in this model, keratinocyte necroptosis is 
a more potent trigger of skin inflammation compared with apoptosis”, 
although, as already discussed, keratinocyte apoptosis has been shown 
to trigger inflammation in other models. In conclusion, although there 
are indications that necroptosis is more inflammatory than apoptosis, 
newand more specific in vivo models are needed to directly compare the 
capacity of these two pathways of regulated cell death to trigger inflam- 
mation and address the underlying mechanisms. Such models could take 
advantage of intracellular mediators that exclusively induce apoptosis (for 
example, tBid) or necroptosis (for example, constitutively active MLKL 
mutants) to specifically address the function of each cell-death pathway 
without interference from extracellular inducers. 


Necroptosis in human disease 
Studies in mice have provided strong evidence suggesting that necroptosis 
has a crucial role in disease pathogenesis. In addition to acting as an initia- 
tion signal, regulated cell death could contribute to the amplification and 
chronicity of inflammation since many cytokines produced during the 
immune response (such as, TNF family members or INFs) are potent cell- 
death inducers. A vicious circle of cell death, DAMP release, immune-cell 
activation and release of death-inducing cytokines may fuel prolonged 
non-resolving inflammatory responses and contribute to the pathogenesis 
of chronic inflammatory diseases (Fig. 3). It is intriguing to speculate that 
the pathogenic role of TNF in chronic inflammation could be, at least 
in part, due to its cell-death inducing properties. However, few studies 
addressing the role of necroptosis in human inflammatory diseases exist 
and these rely largely on correlative evidence such as the upregulation of 
RIPK3, which is only indicative and does not provide proof that necrop- 
tosis takes place and is causally associated with disease pathogenesis. 
Epithelial cell death has been associated with intestinal inflammation 
in inflammatory bowel disease (IBD) and although early reports focused 
on apoptosis it is not clear whether necroptosis also occurs’”, in particu- 
lar because most studies used TdT-mediated dUTP nick end labelling 
(TUNEL) assays that cannot distinguish between apoptotic and necrop- 
totic cells. Mouse-model studies showed that epithelial-cell necropto- 
sis induces intestinal inflammation, suggesting that necroptosis could 
also contribute to the pathogenesis of IBD in humans!”””"?, However, 
evidence that necroptosis is implicated in human IBD remains limited. 
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An initial study identified dying cells with necrotic morphology in ileal 
crypt sections from patients with Crohns disease, indicating that Paneth 
cells undergo necroptosis”. A more recent study showed that RIPK3 and 
MLKL were upregulated, whereas caspase-8 expression was reduced in 
the inflamed mucosa of children with Crohn’s disease, ulcerative colitis 
or allergic colitis and suggested that necroptosis is a crucial event that 
amplifies inflammation and contributes to these intestinal pathologies”. 
Although these studies do indicate that necroptosis might be involved in 
IBD they are based on rather weak and correlative evidence, therefore 
more research using specific and sensitive molecular markers of necrop- 
tosis will be required to establish the potential role of necroptosis in IBD. 
Mouse-model experiments identified keratinocyte necroptosis as a 
potent trigger of skin inflammation'**’, suggesting that keratinocyte 
necroptosis might also be implicated in the pathogenesis of human 
inflammatory skin diseases. A recent report suggested that keratinocyte 
necroptosis contributes to severe cutaneous adverse drug reactions in 
humans and in a mouse model of the disease''”. However, the occurrence 
and potential role of necroptosis in human inflammatory skin diseases 
such as psoriasis has not been investigated. Recent reports based on mouse 
models suggested that RIPK3-mediated necroptosis contributes to liver 
injury and inflammation induced by alcohol but also in non-alcoholic 
steatohepatitis'”'”’. Upregulation of RIPK3 was detected in hepatocytes 
in the livers of people with alcohol-induced liver disease and non-alco- 
holic steatohepatitis, suggesting that RIPK3-dependent necroptosis 
could contribute to hepatocyte death and inflammation”. Moreover, 
immunostaining with antibodies that recognize phosphorylated MLKL, 
which promise to provide a specific marker of necroptosis, suggested that 
hepatocyte necroptosis occurs in the livers of people with drug-induced 
liver injury”’. Therefore, although mouse-model experiments strongly 
suggest that necroptosis could be implicated in inflammatory diseases, 
more studies using definite molecular markers of necroptosis and func- 
tional validation using specific inhibitors will be required to establish 
the involvement of necroptosis in the pathogenesis of human diseases. 


Outlook 

Necroptosis is now established as an important pathway of regulated cell 
death, but many questions remain to be addressed. What are the mecha- 
nisms controlling the formation and activity of the necrosome? What is 
the role of specific ubiquitylation events and enzymes? Whatis the precise 
mechanism of MLKL-mediated necroptosis? How is the kinase activ- 
ity of RIPK1 and RIPK3 regulated and what are their substrates? What 
are the specific kinase-dependent and scaffolding functions of RIPK1 
controlling inflammation, cell survival, apoptosis and necroptosis? What 
is the relative contribution of cell-death-dependent and -independent 
functions of RIPK3 and RIPK1 in inflammation? How does necroptosis 
contribute to the initiation, amplification and chronicity of inflamma- 
tion? Does necroptosis display different properties in regulating inflam- 
mation compared with other types of regulated cell death? What is the 
relationship between RIPKs, necroptosis and inflammasome activation? 
These questions become particularly important when considering that 
stimuli triggering cell death suchas TNF also potently induce inflamma- 
tory gene expression. The clinical efficacy of anti-TNF therapy has estab- 
lished TNF as a key player in chronic inflammatory diseases including 
IBD, rheumatoid arthritis and psoriasis, but the TNF-dependent patho- 
genic mechanisms remain unclear. Determining the relative contribution 
of cell-death-dependent and -independent pathways in TNF-induced 
chronic inflammation may lead to new and more specific therapeutic 
targets. New preclinical mouse-model studies will be needed to answer 
these mechanistic questions. Addressing the role of necroptosis in human 
disease remains a major challenge and will require the establishment of 
specific, sensitive and reliable molecular markers of necroptosis. Current 
data raise the hope that manipulating necroptosis could provide new and 
urgently needed therapeutic opportunities in acute and chronic inflam- 
matory conditions. It remains to be seen ifthis promise will be fulfilled. = 
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Function and information 
content of DNA methylation 


Dirk Schtibeler’? 


Cytosine methylation is a DNA modification generally associated with transcriptional silencing. Factors that regulate 
methylation have been linked to human disease, yet how they contribute to malignances remains largely unknown. 
Genomic maps of DNA methylation have revealed unexpected dynamics at gene regulatory regions, including active 
demethylation by TET proteins at binding sites for transcription factors. These observations indicate that the underlying 
DNA sequence largely accounts for local patterns of methylation. As a result, this mark is highly informative when study- 
ing gene regulation in normal and diseased cells, and it can potentially function as a biomarker. Although these findings 
challenge the view that methylation is generally instructive for gene silencing, several open questions remain, including 
how methylation is targeted and recognized and in what context it affects genome readout. 


ethylation of nucleotides provides a molecular means to revers- 
M= mark genomic DNA. Bacteria can methylate adenosine or 

cytosine to identify and degrade invading DNA and to track 
mismatch repair and the progress of genome duplication before cell divi- 
sion”, In eukaryotes, DNA methylation only occurs at cytosine residues. 
Since the discovery that in vitro methylated DNA is transcriptionally 
inactive when transfected into Xenopus oocytes’ or cultured mamma- 
lian cells**, methylation has been functionally linked to gene repression. 
Importantly, methylation is not always essential to eukaryotic gene regula- 
tion as it is absent in many organisms. These include metazoans such as 
dipteran insects or Caenorhabditis elegans, illustrating that transcriptional 
changes during development do not necessarily require the organism to 
methylate DNA. Although DNA methylation is obligatory in many clades, 
its prevalence and genomic distribution varies widely, suggesting that 
there are distinct modes of targeting and function” ’. 

Methylation of DNA can change the functional state of regulatory 
regions, but it does not change the Watson-Crick base pairing of cytosine. 
It thus presents the classic ‘epigenetic’ mark and is functionally involved in 
many forms of stable epigenetic repression, such as imprinting, X chromo- 
some inactivation and silencing of repetitive DNA‘. In vertebrates, herit- 
able methylation only occurs at the CpG dinucleotide. Sequence symmetry 
of CpGs enables propagation of the methyl mark through cell division’; 
in 1975, this was proposed as a pathway for cellular memory of transcrip- 
tional states'””’. This potential for inheritance coupled with the fact that 
DNA methylation patterns change during development and disease*”” 
partially explains the interest in DNA methylation asa memory module”. 

Technological advances have resulted in genomic maps of DNA 
methylation at unprecedented resolution, revealing that regulatory 
sequences are indeed unmethylated when active'* '*. Despite these 
striking but correlative observations, our ability to correctly assign 
a function to the local presence of DNA methylation at particular 
genes has remained surprisingly limited. This Review summarizes 
recent advances in our understanding of the regulation and function 
of DNA methylation in mammals, and discusses the utility of DNA 
methylation as a cellular marker in basic biology and biomedicine. 


Conservation of methylation patterns in non-vertebrates 
Among the many clades that methylate their genome, vertebrates are 
unique in that cytosine methylation occurs throughout the entire genome 


(Fig. 1). By contrast, plants and invertebrates that have been analysed so 
far show ‘mosaic’ methylation patterns because only specific genomic ele- 
ments are targeted” (Fig. 1). More specifically, repetitive DNA and actively 
transcribed sequences are methylated”. In the case of repetitive DNA, it 
is evident that DNA methylation is used for repressing expression and 
preventing further expansion of these elements'”™. In plants, this target- 
ing occurs through short RNAs derived from repeat transcripts that guide 
de novo methylation to this class of elements”. A similar PIWI-interacting 
RNA (piRNA)-guided process seems to occur in the male mammalian 
germ line when DNA methylation is re-established’*”*. Furthermore, 
DNA methylation of pro-viruses has been shown to depend, in part, on 
the presence of repressive histone methylation at lysine 9 of histone H3 
(H3K9me)”*, illustrating distinct RNA and chromatin pathways that guide 
DNA methylation to repeats. 

The second canonical target is methylation of actively transcribed 
genes, a process that does not seem to primarily regulate these genes®. 
This is prominent in organisms with mosaic DNA methylation patterns 
and can even occur in the absence of repeat methylation, as in the case 
of the invertebrate chordate Ciona intestinalis**”’ . Despite its strong evo- 
lutionary conservation, genic methylation remains surprisingly poorly 
understood at both the molecular and functional level. Current models 
suggest that it helps to counteract the disruption of chromatin, such as 
nucleosome displacement, which is caused by elongating RNA poly- 
merase*”*”’, This speculation builds on established chromatin pathways 
such as the marking of transcribed sequences by methylation at lysine 
36 of histone H3 (H3K36me)*. Work in baker's yeast (Saccharomyces 
cerevisiae), which lacks DNA methylation, revealed that H3K36me 
recruits enzymes such as histone deacetylases, resulting in more densely 
packed chromatin. In the absence of this pathway, transcription creates a 
more open chromatin structure, leading to spurious activation of cryptic 
start sites”. Such a process might be even more relevant in the genomes of 
multicellular organisms, which harbour substantially larger genes. Indeed, 
DNA methylation has been suggested to suppress intragenic promoters 
in mammalian cells”, and remethylation of genes in cancer cells after 
treatment with an inhibitor of methylation maintenance occurs much 
faster at actively transcribed genes and seems to be required for their 
proper expression”. Notably, retroviruses preferentially integrate into 
actively transcribed genes”, and a process that would methylate histones 
and DNA and potentially silence them after genic integration thus seems 
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Figure 1 | Genomic distribution of methylated cytosine in a typical 
invertebrate and vertebrate genome. The representative genomic region 
includes an example of an active and an inactive gene with proximal (promoter) 
and distal (enhancer) regulatory regions. The height of the bar indicates 

the relative proportion of DNA methylation (5-methylcytosine, 5mC) that 

is observed in each region. CpG islands (CGIs), which often overlap with 
promoter regions, generally remain unmethylated, whereas CG-poor promoters 
are methylated when not active. 


plausible. Furthermore, genic methylation has been linked to splicing 
in mammalian cells, as subtle methylation differences were observed 
between introns and exons****. However, experimental evidence to sup- 
port this link remains scarce”’, and an explanation for how the elongating 
polymerase or splicing machinery could be affected by the comparatively 
small differences in methylation levels is lacking. 


Methylation in vertebrates 

The widespread methylation of vertebrate genomes suggests that their 
methylation might be a default state. This causes CpG depletion over 
evolutionary time through inefficient base-excision repair. Although 
spontaneous deamination of cytosine results in uracil, a base efficiently 
removed by uracil-DNA glycosylase (UDG), deamination of a methyl- 
ated cytosine results in thymine, a proper genomic base. The resulting 
mismatch frequently manifests in a C to T transition, despite the pres- 
ence of thymine-DNA glycosylase (TDG) and methyl-CpG-binding 
domain protein 4 (MBD4): two glycosylases that are thought to tar- 
get this particular mismatch within methylated CGs and lead to the 
removal of thymine”’. Notably, a C to T transition at CpG dinucleotides 
is the most frequent mutation observed in human diseases and between 
closely related mammals, arguing that genome-wide and genic DNA 
methylation comes at the price of an increased mutational load. Among 
vertebrates, CG depletion is particularly prominent in mammalian 
genomes, but less obvious in those of fish or Xenopus*’. Exceptions to 
this general loss of CGs reside in genomic elements termed CpG islands 
(CGIs)**. CGIs predominantly overlap with promoter regions and retain 
their expected CG content, as they generally remain unmethylated in 
the germ line, with notable exceptions”. Importantly, methylation of 
CGls causes robust transcriptional repression and CGI methylation is 


required for all reported examples of long-term mono-allelic silenc- 
ing, including X inactivation” and genomic imprinting”. Although 
CGI methylation can result in stable repression of the linked gene (for 
example, see ref. 42), surprisingly few CGIs change DNA-methylation 
state during normal development. Prominent examples include a 
set of germline-specific genes that require promoter methylation for 
their repression in somatic cells’. Importantly, however, inactive CGI 
promoters do not, in general, acquire DNA methylation but become 
methylated at lysine 27 of histone H3 (H3K27me3), a mark set by the 
Polycomb system**”*. Interestingly, CGIs marked by H3K27me3 are 
nevertheless more susceptible to DNA methylation during differentia- 
tion and in disease states such as cancer***’. Furthermore, CGIs har- 
bour H3K4 methylation independently of their activity, but only in the 
absence of DNA methylation”. This modification can repulse de novo 
methyltransferases in vitro and thus is thought to be causally involved 
in maintaining the hypomethylated state of CGIs”. 

Recent genome-wide mapping of DNA methylation at single-base reso- 
lution revealed that CG-poor regulatory regions generally acquire a low 
methylation state when occupied by transcription factors'*”*. Such varia- 
ble DNA methylation in mammals closely reflects changes in gene regula- 
tion and, as a result, methylome data provide a rich source of information 
about ongoing gene activity (see “Utility of DNA methylatiom). Although 
the functional relevance of reduced methylation at CG-poor regulatory 
regions is still unclear, it is tempting to imply an instructive role for DNA 
methylation in distal gene regulation. This popular model assumes a 
generally repressive effect of DNA methylation regardless of the position 
and densities of CGs within a particular regulatory region. Experimen- 
tal evidence does not, however, generally support this presumption. For 
example, several factors have been shown to bind methylated CG-poor 
sequences and, in turn, lead to their demethylation'** °51 Tn this scenario, 
changes in DNA methylation occur downstream of transcription-factor 
binding to their target sequence, arguing against a generally instructive 
role. It is likely that other transcription factors are more sensitive to DNA 
methylation, in particular those that contain a CG in their binding motif” 
(Fig. 2). Potential effects might also only apply to certain binding sites such 
as those with lower affinity, at which DNA methylation might further 
reduce the likelihood of binding. Nevertheless, there is limited evidence 
at present that DNA methylation at CG-poor regulatory regions is gener- 
ally instructive. This does not mean that methylation is irrelevant because 
tissue-specific deletions of, for example, DNA methyltransferases show 
very distinct phenotypes, including clear changes in gene expression, 
and mutations in these enzymes can contribute to disease’*. However, 
it remains to be determined whether these phenotypes result from local 
differential methylation of regulatory regions or global perturbations, 
including reactivation of repeats and/or methylated CGIs. 


Setting and removing DNA methylation 

In the textbook scenario, de novo DNA methyltransferases DNMT3A 
and DNMT3B” in combination with DNMT3L™ establish a pattern 
of methylation that is then faithfully maintained through cell division 
by the maintenance methyltransferase DNMT1 (ref. 55) and associ- 
ated proteins”’. This model of stable DNA methylation propagation 
has recently been revised in several ways. It was long known that DNA 


Methylation-sensitive 
transcription factor 


a Methylation-insensitive b a c 
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Figure 2 | Potential scenarios for the interplay between cytosine methylation 
(shown by level of 5-methylcytosine) and transcription-factor binding. a, A 
methylation-insensitive transcription factor causes reduced methylation after 


binding. b, A transcription factor binds specifically to the methylated state of 
its binding site. c, A methylation-sensitive transcription factor is blocked by 
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5-methylcytosine (5mC). d, Methyl-CpG-binding domain (MBD) proteins bind 
to the methylated state, leading to indirect repression, which probably requires 
high local density of CGs (shading). e, A methylation-insensitive transcription 
factor functions as a pioneer factor and creates a site of reduced methylation that 
allows a methylation-sensitive factor to bind. 
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methylation can be lost passively through imperfect maintenance”, but 
the recent discovery of the ten-eleven translocation (TET) family of pro- 
teins provided a convincing path for catalysed active demethylation in 
vertebrates”. The three members of this protein family have since been 
implicated in development, meiosis, maintenance of imprinting and 
stem-cell reprogramming” ™. TET proteins convert 5-methylcytosine 
(5mC) into 5-hydroxymethylcytosine (5hmC), a modified base that was 
first described more than 30 years ago™. Further iterative oxidations cata- 
lysed by TET result in 5-formylcytosine (5fC) and 5-carboxylcytosine 
(5caC)” (Fig. 3), which can be efficiently removed by TDG™” — a 
pathway referred to as ‘active’ demethylation. These modifications cre- 
ate an opportunity to identify sites of TET action because they can be 
specifically detected” ”’, but their detection and quantification remains 
challenging owing to their low genomic abundance compared with 
5mC. Current data argue that active regulatory regions and transcribed 
sequences show increased levels of hydroxymethylation, which is indica- 
tive of a higher turnover™”*”*. Loss of all three TET proteins leads to 
increased DNA methylation at enhancer regions in stem cells and subtle 
expression changes of linked genes”, suggesting that active demethylation 
could contribute to the activity state of distal regulatory regions’*. Notably, 
active DNA demethylation could occur in most cell types because TET 
proteins are promiscuously expressed at varying levels in non-dividing 
somatic cells. Proper quantification of DNA-methylation-turnover kinet- 
ics is crucial not only for interpreting genomic maps of this mark but 
also for concepts of epigenetic memory that invoke DNA methylation. 
Memory implies stability of the modified base, which could be impaired 
by pathways of active demethylation. However, a conclusive picture 
regarding how turnover is regulated is lacking. Current models of binding 
and local activity for both demethylases and methyltransferases, which 
are largely based on in vitro interaction data, suggest that histone modi- 
fications and RNA-DNA hybrids have a role in repelling and recruiting 
these enzymes'””*””*°. Recruitment probably constitutes just one level of 
regulation. For example, TET activity can be enhanced by the addition of 
vitamin C to the culture medium of stem cells”, suggesting that cofactors 
modulate enzyme activity. 


CG-specific readers 
The CG dinucleotide provides distinct signals based on its methylation 
state that can be differentially recognized by specific protein domains. 
The methylated form of CG is recognized by functional MBD-con- 
taining proteins in vitro and in vivo” ™, which are thought to indirectly 
contribute to methylation-mediated repression® (Figs 2d, 3b). This con- 
cept of indirect repression might explain why a high local concentration 
of methylated CpGs, such as at methylated CGIs, do confer efficient 
repression. Importantly, individual deletions of genes encoding MBD 
proteins do not result in reactivation of methylated CGIs and thus MBD 
proteins are thought to function in a redundant fashion. This model 
awaits testing through comprehensive deletions of the MBD protein 
family. CKXC domains, however, can specifically recognize unmeth- 
ylated CGs and can target proteins, such as CXXC finger protein 1 
(CFP1)* or the histone demethylases KDM2A and KDM2B to unmeth- 
ylated CGIs*’** (Fig. 3b). It has been suggested that this recruitment 
might help to maintain an unmethylated state similar to that seen with 
classic transcription factors, but evidence for this model is still miss- 
ing*’. Importantly, the presence ofa functional CXXC domain does not 
necessarily predict targeting of proteins to unmethylated CGs. In the 
case of DNMT1, the CXXC domain seems to be used for proofreading’. 
A crucial question is whether specific binders exist for cytosine oxi- 
dation derivatives, which would suggest that these bases function as 
distinct signals. This is a topic of active research, and several candidate 
proteins have been proposed”*””*”. However, the low frequency of these 
modifications has so far stymied efforts to convincingly show selective 
binding in vivo”. It also remains to be determined whether methyla- 
tion of cytosines outside the CpG context, which seems to bea frequent 
event in embryonic stem cells, subsets of neurons” and oocytes”, is 
specifically recognized. 


REVIEW 


Genetic determination and environmental influence 

How much do our methylomes vary, and how can this variation be 
explained? Recent genome-wide association studies compared DNA 
sequence variation within human cohorts with changes in chromatin 
modifications’ or DNA methylation”. The data showed that genetic 
variation explains a large part of the observed changes in histone modi- 
fications or DNA methylation’. The likely explanation is that muta- 
tions within regulatory regions affect binding of transcription factors, 
which in turn influence DNA methylation and histone modifications 
(Fig. 4). Alternatively, a mutation in a transcription factor could impair 
its ability to bind a specific sequence, leading to increased methyla- 
tion. This transcription-factor dependency is similar to that observed 
during development, in which expression of cell-type-specific tran- 
scription factors coincides with reduced methylation of their binding 
sites (Fig. 4). These genome-wide correlations are also supported by 
gene-insertion experiments showing that local DNA sequence can be 
the primary determinant of DNA methylation state””’. Indeed a role 
for transcription factors in establishing DNA methylation states was 
observed 20 years ago™””**. Thus several lines of evidence reinforce 
the notion that the underlying DNA sequence, through its recognition 
by transcription factors, seems to account for a substantial part of the 
observed DNA methylation and chromatin patterns and their variation 
between individuals or cell types. 

How transcription factors influence chromatin and DNA methylation 
and which factors are instructive or sensitive to chromatin states remains 
largely unexplored (Fig. 2). These are crucial questions, because under- 
standing this genetic and epigenetic crosstalk will be essential to correctly 
interpret and assign function to patterns of DNA methylation in develop- 
ment and disease. It should help in identifying sites that are more suscep- 
tible to sporadic change and environmental exposure. Indeed, differences 
in DNA methylation are observed in isogenic subjects, such as identical 
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Figure 3 | Setting, erasing and recognizing cytosine methylation. a, Different 
methylation states of the CG dinucleotide and the enzymatic pathways that set, 
maintain and erase the mark. The pathways leading from the oxidized forms 

to the unmethylated state are under debate”. DNMT, DNA methyltransferase; 
TDG, thymine-DNA glycosylase. b, A subset of CXXC-domain-containing 
proteins are listed that can specifically bind to the unmethylated CG 
dinucleotide and could potentially reinforce the unmethylated state or recruit 
regulatory proteins to unmethylated CG islands. Methyl-CpG-binding domain 
(MBD) proteins specifically bind to the methylated CG with little or no further 
sequence sensitivity, potentially mediating transcriptional repression, which 
would be strongest in methylated CG islands. Readers of oxidized forms are not 
shown owing to ongoing debate about proposed candidates”. 
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Figure 4 | Potential DNA sequence determinants of cytosine methylation at 
CG-poor regions. Ina simplified model, transcription-factor binding causes 
reduced methylation at its binding site. Loss of expression of the respective 
transcription factor in development or disease will cause increased methylation. 
Mutations to the transcription factor that affect its binding preference will 
influence genomic methylation patterns. Mutation in the DNA binding site will 
abolish binding even in a cell expressing the transcription factor, indicating how 
genetic variation can result in methylation differences between individuals. 


twins, but the actual level is under debate”. Furthermore, developing 
mouse embryos display methylation changes after severe undernutrition 
in utero”. The question remains as to whether these differences occur 
downstream of perturbed gene activity or are causally involved in the 
resulting phenotype. In the case of the undernutrition model, the meta- 
bolic phenotype can even propagate across generations, despite the lack 
of differences in DNA methylation’. Generally, although a role for DNA 
methylation in trans-generational inheritance is clearly appealing, there 
is limited evidence for this pathway in mammals (reviewed in ref. 101). 
At the same time, stably inherited epialleles clearly exist in plants'”’. One 
explanation for this difference is that mammals show waves of DNA- 
methylation gain and loss in the germ line and early development, leading 
to the erasure of most acquired methylation. This important aspect of 
DNA methylation biology is not the focus of this Review and has recently 
been discussed in detail’?”***, 


New disease links 

Perturbations of DNA methylation patterns are frequently observed in 
disease, particularly in cancer. These perturbations include methylation 
of CGI promoters for tumour suppressor genes, implying a functional 
role®. At the time of the initial observations, support for this model in 
the form of recurring mutations, in tumours, of proteins that regulate 
DNA methylation was missing. This changed with the discovery of 
the enzymatic activity of TET proteins — TET2 mutations have been 
linked to many myeloid malignancies'*”'”’ — and through the obser- 
vation that DNMT3< is one of the most frequently mutated genes in 
acute myeloid leukaemia (AML) (mutated in 25% of adults with the 
disease)'™. Although the actual mechanism in both cases remains to be 
determined, it is striking that TET2 and DNMT3A loss of function seem 
to be primary events. Mutations in DNMT3A are pre-leukaemic and 
can occur in blood stem cells, but only in combination with subsequent 
‘driver mutations’ do they lead to leukaemia’. 

In fact, recurrent mutations of many genes encoding chromatin 
components have now been identified in diverse cancer types’”. It 
is unclear how these proteins actually contribute to disease, but one 
hypothesis is that chromatin perturbations make the regulatory land- 
scape more vulnerable to subsequent mutations’”’. Such concepts of 
genetic buffering by chromatin pathways have already been proposed 
based on genetic interaction screens between hypomorphic mutants, 
for example in C. elegans'”*. In this study, mutations in chromatin 
modifiers enhanced the phenotype of mutations in many different 
transcription factors. One interpretation of these findings is that 
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chromatin pathways globally contribute to the stability of transcrip- 
tional regulation rather than directing activity to specific genes. It is 
tempting to speculate that DNA methylation might also contribute 
in this way to genome regulation. 


Utility of DNA methylation 

Although the function of DNA methylation at CG-poor regulatory 
regions remains unclear, it is evident that patterns of this mark have high 
information content about the ongoing activity of transcription factors 
and, thus, can help to identify cell-type-specific aspects of gene regula- 
tion”. For this reason, DNA methylation profiles can provide insight into 
many aspects of biology, but also function as biomarkers in medicine. It 
is important to keep in mind that the value of a biomarker is only defined 
by its ability to predict disease state or treatment response regardless of 
whether the measured parameter is causally involved in the disease. Part 
of the appeal of DNA methylation is the feasibility of the analysis. Bisul- 
phite conversion of unmethylated cytosines followed by DNA sequencing 
requires no live cells and limited amounts of DNA, which can even below 
quality, making DNA methylation less sensitive to specimen handling 
compared with RNA or proteins””’. Asa result, bisulphite sequencing, 
which, importantly, cannot distinguish between 5mC and S5hmC, can 
even be conducted on DNA isolated from small amounts of fixed tissue". 
Any primary sample from which DNA can be sequenced should be suit- 
able for bisulphite sequencing, a technique readily performed in any labo- 
ratory that is set-up for genome sequencing. Given that technical issues do 
not represent a significant hurdle, the crucial question is how much medi- 
cally relevant information can be obtained from such DNA methylation 
analysis. Potential pitfalls include cellular heterogeneity within disease 
samples, confounding genetic variation between individuals or intrinsic 
variability, which could limit the utility of methylation measurements. 

Nevertheless, in the case of medulloblastoma, a detailed analysis of 
methylation levels in regulatory regions enabled identification of disease- 
and tumour-subtype-specific changes'”. These changes reflect not only 
transcription-factor activity but also putative new markers for disease 
states'””. The aforementioned study adds to a list of examples for which 
DNA methylation can assist in identifying tumour subtypes”. 

Moreover, in most disease settings, cells from the affected tissue can- 
not be easily obtained, and it is questionable whether those that are 
available will be informative (for example, blood cells). Several studies 
have now shown that non-affected tissue can be informative'’’, even 
leading to treatment decisions’ and stratification of clinical cohorts’. 
Further insights into the functionality of DNA methylation will come 
from ongoing large cohort studies, leading towards a more comprehen- 
sive view of whether and how disease phenotypes can be associated with 
specific DNA methylation patterns''®. The link between gene activity 
and DNA methylation, as well as the accessibility of this base modifica- 
tion, justify further exploration. One intriguing possibility is that DNA 
methylation might be prognostic of disease onset or personal risk in 
combination with genetic predisposition and environmental exposure. 

The potential use of DNA methylation is not limited to biomedical 
applications. Measurements at the resolution of individual molecules 
and cells promise to substantially advance our understanding of regula- 
tion in biology. They provide more quantitative information and reveal 
heterogeneity within the cell population or dynamic changes at the level 
of individual cells'””. Bisulphite conversion followed by high-throughput 
sequencing represents a single-molecule measure and has already been 
successfully applied in stem-cell biology to define population heterogene- 
ityand the degree of pluripotency after induced reprogramming'*’”. For 
example, subpopulations of mouse embryonic stem cells that differ in the 
level of the transcription factor Rex1 can be separated by their DNA meth- 
ylation patterns’”’. Based on chemical inhibition, DNA methylation was 
argued to directly contribute to the switch between these populations’”’. 
Studies that tackle questions of cell identity and origin will further ben- 
efit from the recent development of approaches that allow genome-wide 
measurements at the level of single cells’”’. 

Tracking methylation patterns over time can also reveal insights into 
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the fidelity of DNA methylation maintenance. A comparison of the het- 
erogeneity of DNA methylation at specific genomic loci suggested that 
human stem cells in culture undergo high turnover of DNA methylation. 
By contrast, fibroblast cells seem to show more stable inheritance of fre- 
quent sporadic changes, implying that somatic cells show more faithful 
maintenance and thus pass on sporadic changes in DNA methylation”. 
If the same applies in vivo, high-coverage DNA methylation measure- 
ments from single cells should provide information on lineage trees. This 
approach, when applied to an in vitro model of cellular transformation, 
has revealed insights into the kinetics of progressive changes in regions 
that are commonly hypermethylated in cancer’. Although it remains to 
be seen whether this principle is relevant to carcinogenesis in vivo, it high- 
lights how DNA methylation can be used to track cellular states over time. 


Outlook 

The emerging picture is that genomic DNA methylation in mammals 
reflects, to a large extent, cell-intrinsic regulation encoded within the 
DNA sequence in the form of CG density and binding motifs for tran- 
scription factors. This genetic dependency is very reminiscent of histone 
modifications, which are also an integral part of the activation of regula- 
tory regions and the process of transcription. It explains why chromatin 
and DNA modifications are informative indicators of underlying regula- 
tory activity, but it does not reveal what their actual impact is on the regu- 
lation of individual genes. This is likely to be highly contextual at the level 
of DNA sequence and binding factors, as shown by the efficient repression 
of CGIs by DNA methylation, which might represent the exception rather 
than the rule because dynamics in DNA methylation occur elsewhere in 
the genome. This exemplifies our limited understanding of how local 
levels of DNA methylations are set, turned over by demethylation and, 
in turn, recognized by transacting factors. A better understanding of this 
regulation is required to define these potential functions and determine 
how misregulation contributes to disease. m 
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genetic diversity is needed. The African Genome Variation Project provides a resource with which to design, implement 
and interpret genomic studies in sub-Saharan Africa and worldwide. The African Genome Variation Project represents 
dense genotypes from 1,481 individuals and whole-genome sequences from 320 individuals across sub-Saharan Africa. 
Using this resource, we find novel evidence of complex, regionally distinct hunter-gatherer and Eurasian admixture 
across sub-Saharan Africa. We identify new loci under selection, including loci related to malaria susceptibility and 
hypertension. We show that modern imputation panels (sets of reference genotypes from which unobserved or missing 
genotypes in study sets can be inferred) can identify association signals at highly differentiated loci across populations in 
sub-Saharan Africa. Using whole-genome sequencing, we demonstrate further improvements in imputation accuracy, 
strengthening the case for large-scale sequencing efforts of diverse African haplotypes. Finally, we present an efficient 
genotype array design capturing common genetic variation in Africa. 


Globally, human populations show structured genetic diversity as a 
result of geographical dispersion, selection and drift. Understanding 
this variation can provide insights into evolutionary processes that 
shape both human adaptation and variation in disease susceptibility’. 
Although the Hapmap Project” and the 1000 Genomes Project’ have 
greatly enhanced our understanding of genetic variation globally, the 
characterization of African populations remains limited. Other efforts 
examining African genetic diversity have been limited by variant density 
and sample sizes in individual populations’, or have focused on isolated 
groups, such as hunter gatherers (HG)°*, limiting relevance to more 
widespread populations across Africa. 

The African Genome Variation Project (AGVP) is an international 
collaboration that expands on these efforts by systematically assessing 
genetic diversity among 1,481 individuals from 18 ethno-linguistic groups 
from sub-Saharan Africa (SSA) (Fig. 1 and Supplementary Methods 
Tables 1 and 2) with the HumanOmni2.5M genotyping array and whole- 
genome sequences (WGS) from 320 individuals (Supplementary 


Methods Table 2). Importantly, the AGVP has evolved to help develop 
local resources for public health and genomic research, including strength- 
ening research capacity, training, and collaboration across the region. 
We envisage that data from this project will provide a global resource 
for researchers, as well as facilitate genetic studies in Africa’. 


Population structure in SSA 


On examining ~2.2 million variants, we found modest differentiation 
among SSA populations (mean pairwise F;; 0.019) (Supplementary 
Methods and Supplementary Table 1). Differentiation among the Niger- 
Congo language groups—the predominant linguistic grouping across 
Africa was noted to be modest (mean pairwise Fs; 0.009) (Supplemen- 
tary Table 1), providing evidence for the ‘Bantu expansion’—a recent 
population expansion and movement throughout SSA originating in 
West Africa around 3,000 to 5,000 years ago’. 

We identified 29.8 million single-nucleotide polymorphisms (SNPs) 
from Ethiopian, Zulu and Bagandan WGS (Extended Data Fig. 1 and 
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Figure 1 | Populations studied in the AGVP. a, 18 African populations 
studied in the AGVP including 2 populations from the 1000 Genomes Project. 
(The term ‘Ethiopia’ encompasses the Oromo, Amhara and Somali ethno- 
linguistic groups.) b, c, ADMIXTURE analysis of these 18 populations alone 
(n = 1,481) (b) and ina global context (n = 3,904) (c). Each colour represents a 
different ancestral cluster, with clusters 2-6 represented along the y-axis in b 


Supplementary Methods). A substantial proportion of unshared (11%- 
23%) and novel (16%-24%) variants were observed, with the highest 
proportion among Ethiopian populations (Extended Data Fig. 1). The 
high proportion of unshared variation among populations recapitulates 
the need for large-scale sequencing across Africa, including among ge- 
netically divergent populations. 

We used principal component analysis to explore relationships among 
AGVP populations (Extended Data Figs 2-5, Supplementary Figs 1 and 2). 
PC] appeared to representa cline extending from West and East African 
populations towards Ethiopian populations, possibly suggesting Eur- 
asian gene flow, while PC2 separated West African and South/East A fri- 
can populations (Extended Data Fig. 2). Inclusion of the 1000 Genomes 
Project, North African and Khoe-San (Khoisan) populations in principal 
component analysis (Extended Data Figs 3-5, and Supplementary Figs 1 
and 2) suggested possible HG ancestry among southern Niger-Congo 
groups—highlighted by clustering towards the Khoe-San, in addition 
to confirming a cline towards Eurasian populations. ‘Unsupervised’ 
(that is, without including known information on individual ancestry) 
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and clusters 2-18 represented in c. K = 6 and K = 18 were the most likely 
clusters on ADMIXTURE analysis. ADMIXTURE analysis suggests 
substructure between North, East, West and South Africa. Studying these 
populations in the context of Eurasian and African HG populations suggest 
extensive Eurasian and HG admixture across Africa. 


ADMIXTURE? (https://www.genetics.ucla.edu/software/admixture/) 
analysis including the 1000 Genomes Project and Human Origins data 
sets (Fig. 1), also supported evidence for substantial Eurasian and HG 
ancestry in SSA (Fig. 1 and Extended Data Fig. 6). 

To assess the effect of gene flow on population differentiation in 
SSA, we masked Eurasian ancestry across the genome (Supplementary 
Methods and Supplementary Note 6). This markedly reduced population 
differentiation, as measured by a decline in mean pairwise Fsy from 0.021 
to 0.015 (Supplementary Note 6), suggests that Eurasian ancestry has 
a substantial impact on differentiation among SSA populations. We 
speculate that residual differentiation between Ethiopian and other SSA 
populations after masking Eurasian ancestry (pairwise Fs = 0.027) may 
be a remnant of East African diversity pre-dating the Bantu expansion”. 


Population admixture in SSA 
Formal tests for admixture (the three population test or f3 statistic)’’, 


confirmed widespread Eurasian and HG admixture in SSA (Supplemen- 
tary Tables 2 and 3). Quantification of admixture (Supplementary Table 4, 
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Figure 2 | Dating and proportion of Eurasian and HG admixture among 
African populations. The proportion and distribution of Eurasian and 
HG admixture among different populations across Africa, with approximate 


Supplementary Methods and Supplementary Notes 3 and 4) indicated 
substantial Eurasian ancestry in many African populations (ranging from 
0% to 50%), with the greatest proportion in East Africa (Fig. 2 and Sup- 
plementary Table 4). Similarly, HG admixture ranged from 0% to 23%, 
being greatest among Zulu and Sotho (Fig. 2 and Supplementary Table 5). 

We found evidence for historically complex and regionally distinct 
admixture with multiple HG and Eurasian populations across SSA (Fig. 2 
and Supplementary Note 5). Specifically, ancient Eurasian admixture 
was observed in central West African populations (Yoruba; ~7,500- 
10,500 years ago), old admixture among Ethiopian populations (~2,400- 
3,200 years ago) consistent with previous reports’®’”, and more recent 
complex admixture in some East African populations (~ 150-1,500 years 
ago) (Fig. 2, Extended Data Fig. 7 and Supplementary Note 5). Our find- 
ing of ancient Eurasian admixture corroborates findings of non-zero 
Neanderthal ancestry in Yoruba, which is likely to have been introduced 
through Eurasian admixture and back migration, possibly facilitated 
by greening of the Sahara desert during this period'>™. 

Wealso find evidence for complex and regionally distinct HG admix- 
ture across SSA (Fig. 2, Extended Data Figs 7 and Supplementary Note 5), 
with ancient gene flow (~9,000 years ago) among Igbo and more recent 
admixture in East and South Africa (multiple events ranging from 
100 years ago to 3,000 years ago), broadly consistent with historical move- 
ments reflecting the Bantu expansion. An exploration of the likeliest 
sources of admixture in our data suggested that HG admixture in Igbo 
was most closely represented by modern day Khoe-San populations 
rather than by rainforest HG populations (Supplementary Note 5). 
Given limited archaeological and linguistic evidence for the presence 
of Khoe-San populations in West Africa, this extant HG admixture 
might represent ancient populations, consistent with the presence of 
mass HG graves from the early Holocene period comprising skeletons 
with distinct morphological features’, and with evidence of HG rock 
art dating to this period in the western Sahara'®’”. In East Africa, our 
analyses suggested that Mbuti rainforest HG populations most closely 
represented ancient HG mixing populations (Supplementary Note 5), 
with admixture dating to ~3,000 years ago, suggesting that HG ancestry 
here is likely to be older than previously reported’*. The primary source 
of HG admixture in Zulu and Sotho populations was from Khoe-San 
populations (Fig. 2 and Supplementary Note 5), consistent with lin- 
guistic assimilation of click consonants among these populations. 


Positive selection in SSA 

We examined highly differentiated SNPs between European and African 
populations, as well as among African populations to gain insights into 
loci that may have undergone selection in response to local adaptive 
forces (Supplementary Methods). To account for confounding due to 


dating of admixture using MALDER (code was provided by J. Pickrell; see 
Supplementary Information). 


Eurasian admixture, we also conducted analyses after masking Eurasian 
ancestry (Supplementary Methods and Supplementary Note 6). 

On examining locus-specific Europe-Africa differentiation, enrich- 
ment of loci known to be under positive selection was observed among 
the most differentiated sites (P = 1.4 X 10 *'). Furthermore, there was 
statistically significant enrichment for gene variants among these, indi- 
cating that this differentiation is unlikely to have arisen purely from 
random drift (P = 0.0002). Additionally, we found no evidence for back- 
ground selection as the primary driver of differentiation among these 
loci (Supplementary Note 7). 

In addition to genes known to be under positive selection (for exam- 
ple, SLC24A5, SLC45A2 and OCA2"*”, LARGE”! and CYP3A4/5) (Sup- 
plementary Fig. 3), we found evidence of differentiation in novel gene 
regions, including one implicated in malaria (for chemokine receptor 1, 
CR1) (Extended Data Fig. 8). CR1 carries the Knops blood group anti- 
gens and has previously been implicated in malaria susceptibility” and 
severity”, with evidence suggesting positive selection in malaria-endemic 
regions~* (Extended Data Fig. 8). We also identified highly differentiated 
variants within genes involved in osmoregulation (ATP1A1 and AQP2) 
(Extended Data Fig. 8). Deregulation of AQP2 expression and loss-of- 
function mutations in ATP1A1 have been associated with essential and 
secondary hypertension, respectively~*”®. Climatic adaptive changes in 
these gene regions could potentially provide a biological basis for the 
high burden of hypertension and differences in salt sensitivity observed 
in SSA”. 

In contrast, overall differentiation among African populations was 
modest (maximum masked F,; = 0.19) (Supplementary Fig. 4) and only 
56/1,237 sites remained in the tail distribution after masking (Supplemen- 
tary Methods, Supplementary Table 6). This suggests that a large pro- 
portion of differentiation observed among African populations could 
be due to Eurasian admixture, rather than adaptation to selective forces 
(Supplementary Note 6). Genes known to be under selection were nota- 
bly enriched among the most differentiated loci after masking of Eur- 
asian ancestry (P = 2.3 X 10°). Among the 56 loci robust to Eurasian 
ancestry masking (Supplementary Table 6), we identified several loci 
known to be under selection (Extended Data Fig. 8), including a highly 
differentiated variant (rs1378940) in the CSK gene region implicated in 
hypertension in genome-wide association studies (GWAS)**. The major 
allele of rs1378940 among Africans was in complete linkage disequilib- 
rium with the risk allele of the GWAS SNP rs1378942 (ref. 29), with the 
frequency of this allele highly correlated with latitude (r = —0.67), pro- 
viding support for local adaptation in response to temperature as a pos- 
sible mechanism for hypertension (Supplementary Fig. 5)°°??. 

Comparing populations residing in endemic and non-endemic infec- 
tious disease regions (Supplementary Methods), we identified several 
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loci associated with infectious disease susceptibility and severity. As well 
as the known sickle-cell locus related to malaria, this approach iden- 
tified additional signals for genes potentially under selection, including 
the PKLR region”, RUNX3”*, the haptoglobin locus, CD163”, IL10°>”*, 
CFH, and the CD28-ICOS-CLTA4 locus (Supplementary Table 7 and 
Extended Data Fig. 8)°’. Similar comparisons for Lassa fever identified 
the known LARGE gene, as well as candidates associated with viral entry 
and immune response, including in the Histocompatibility Leukocyte 
Antigen region, DC-SIGN/DC-SIGNR* (also known as CD209/CLEC4M), 
RNASEL, CXCR6, IFIH1* and OAS2/3 regions (Supplementary Table 7). 
For trypanosomiasis, we identified APOL1 40" as well as several loci im- 
plicated in immune response and binding to trypanosoma, including 
FAS, FASLG*"**, IL23R*, SIGLEC6 and SIGLEC12 (Supplementary 
Table 7)**. For trachoma, we identified signals in ABCA1 and CXCR6, 
which may be important for the growth of the parasite and host immune 
response, respectively (Supplementary Table 7)**. 


Designing medical genetics studies in Africa 


To inform the design of genomic studies in Africa, we addressed the 
following questions: (1) How well do current genotype arrays perform 
in African populations using existing reference panels for imputation? 
(2) Can these genotype arrays and reference panels identify and fine- 
map association signals in populations across Africa? (3) Can we improve 
imputation accuracy in African populations using a new African refer- 
ence panel? and (4) What are the most cost-effective designs for large- 
scale GWAS in Africa? 

The 1000 Genomes Project phase I integrated panel provided rea- 
sonably accurate imputation into the Illumina Omni 2.5M array in all 
populations (Supplementary Note 10). However, imputation accuracy 
was lower among Sotho, Zulu and Afro-Asiatic populations, possibly 
reflecting poor representation of some African haplotypes (including 
Khoe-San haplotypes) within the 1000 Genomes Project panel. These 
findings suggest that improvements in imputation accuracy across diverse 
population groups may require larger and more diverse reference panels. 

Weassessed the reproducibility and potential for fine-mapping asso- 
ciation signals within Africa and globally at several disease susceptibility 
loci (Supplementary Methods, Supplementary Table 8 and Extended 
Data Fig. 9). Current genotype arrays and imputation panels allowed 
for identification of relevant association signals at most loci across pop- 
ulations in SSA, demonstrating that association signals are reproducible 
across populations in SSA (Extended Data Fig. 9 and Supplementary 
Figs 7-18). African populations are likely to provide better fine-mapping 
resolution around the causal locus (Supplementary Table 8). We high- 
light one example here: the sickle-cell anaemia locus (HBB)”, which is 
under positive selection owing to the protection the sickle cells confer 
against severe malaria. This locus showed marked heterogeneity in 
association signals across populations, reflecting different linkage dis- 
equilibrium patterns and allele frequencies among populations in SSA 
(Supplementary Figs 9 and 10). This pattern is probably the result of 
independent selection sweeps at this locus in different parts of Africa, 
leading to differences in hitchhiking rare haplotypes that attained high 
frequencies among different populations”. This suggests that these sig- 
natures are recent and occurred during or after the Bantu expansion, 
consistent with the hypothesis that the advent of agriculture and increased 
malaria transmission may have resulted in increased selection pressure”. 
However, in contrast to previous reports*’, we show that association 
signals even at such highly differentiated loci can be captured with dense 
genotype data using existing reference panels for imputation, despite 
individual population groups not being fully represented in these. This 
suggests that, instead of large-scale population-specific sequencing across 
Africa, what is needed is a broad sequencing approach, targeted at cap- 
turing widespread haplotype diversity. 

To assess the utility of a larger and more diverse African reference 
panel for imputation, we generated a panel integrating the 1000 Genomes 
Project phase I and AGVP WGS panels (Supplementary Methods and 
Supplementary Note 9). Using this integrated panel, we observed marked 
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Figure 3 | Improvement in imputation accuracy with the AGVP WGS 
panel. The substantial improvement in imputation accuracy in some 
populations (Sotho), compared to minimal improvement in others (Igbo) with 
the addition of the AGVP WGS reference panel to the 1000 Genomes Project 
phase I reference panel (‘merged’) suggests poor representation of some 
haplotypes (for example, Khoe-San haplotypes in Sotho) in the 1000 Genomes 
Project reference panel alone (‘1000’). r is the correlation coefficient, 
representing the correlation between imputed and genotyped data, on masking 
each genotyped variant during imputation. MAF, minor allele frequency. 


improvements in imputation accuracy across the whole range of the 
allele frequency spectrum in specific populations poorly represented 
by the 1000 Genomes Project panel (Fig. 3 and Supplementary Note 11). 
These findings suggest that even common haplotypes in some SSA pop- 
ulations may not be sufficiently captured by existing panels, limiting 
our power to examine associations of common variants with disease. 
Importantly, given the specificity of the improvement in imputation 
accuracy, we infer that targeted sequencing of divergent populations 
representing a broad spectrum of haplotypes across Africa, including 
HG and North/East African haplotypes, rather than widespread popu- 
lation sequencing is likely to provide a more efficient strategy to improve 
imputation accuracy and a practicable GWAS framework in Africa. 

Wecompared the utility of existing chip designs (2.5M Illumina) and 
ultralow-coverage WGS designs (0.5, 1X, 2X coverage) to determine 
the optimal design for African GWAS. Sensitivity for common varia- 
tion was >90% at all sequencing depths (Supplementary Note 12). Exam- 
ining the effective sample size for a fixed budget”, we found the effective 
sample size was greater for all ultralow-coverage WGS and chip array 
designs compared with 4x WGS. When computational costs were 
accounted for (Supplementary Note 12), the HumanOmni2.5M array 
provided the greatest effective sample size supporting the development 
and large-scale use of efficient genotype arrays in Africa, where these 
have been underutilized. 

We therefore sought to evaluate a potential chip design to tag common 
variation across a wider range of African populations (Supplementary 
Note 13). Importantly, we show that an array with one million genetic 
variants could capture >80% of common variation (minor allele fre- 
quency >5%) across the genome (Extended Data Fig. 10). These ana- 
lyses suggest that designing a pan-African genotype array to effectively 
capture common genetic variation across Africa is feasible, and could 
greatly facilitate large-scale genomic studies in Africa. 


Discussion 


The marked haplotype diversity within Africa has important implica- 
tions for the design of large-scale medical genomics studies across the 
region, as well as studies of population history and evolution. In this 
context, the AGVP is a resource that will facilitate a broad range of 
genomic studies in Africa and globally. 

Although Africa is the most genetically diverse region in the world, 
we provide evidence for relatively modest differentiation among pop- 
ulations representing the major sub-populations in SSA, consistent with 
recent population movement and expansion across the region begin- 
ning around 5,000 years ago—the Bantu expansion’. Although the history 
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of the Bantu expansion is probably complex, assessments of population 
admixture can provide new insights. We note historically complex and 
regionally distinct admixture with multiple HG and Eurasian popula- 
tions across SSA, including ancient HG and Eurasian ancestry in West 
and East Africa and more recent complex HG admixture in South Africa. 
As well as explaining genetic differentiation among modern populations 
in SSA, these admixture patterns provide genetic evidence for early 
back-to-Africa migrations, the possible existence of extant HG popu- 
lations in western Africa—compatible with archaeological evidence’, 
and patterns of gene flow consistent with the Bantu expansion, includ- 
ing genetic assimilation of populations resident across the region. 

This admixture also has important implications for the assessment 
of differentiation and positive selection in Africa. Accounting for these 
elements, we have identified loci under positive selection that are linked 
with hypertension, malaria, and other pathogens. This provides a proof- 
of-concept for the ability of geographically widespread genetic data within 
Africa to identify loci under selection related to diverse environments. 

Our evidence for the broad transferability of genetic association sig- 
nals and their statistical refinement has important implications for 
medical genetic research in Africa. Importantly, we highlight that such 
studies are feasible and can be enabled through the development of 
more efficient genotype arrays and diverse WGS reference panels for 
accurate imputation of common variation. In this context, we describe 
a framework for a new pan-African genotype array that could directly 
facilitate large-scale genomic studies in Africa. 

A critical next step is the large-scale deep sequencing of multiple and 
diverse populations across Africa, which should be integrated with ancient 
DNA data. This would enable us to identify and understand signals of 
ancient admixture, patterns of historical population movements, and to 
provide a comprehensive resource for medical genomic studies in Africa. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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Extended Data Figure 1 | Allele sharing between sequenced populations in 
the AGVP. a, The overlap of SNPs between 4X WGS data from Zulu, Ugandan 
and Ethiopian individuals (subsampled to 100 samples each). b, The overlap 
of novel variants (those not in the 1000 Genomes Project phase I integrated 
call set, ‘1000G’) between the three populations. c, d, The allele frequency 
spectra of variants in different portions of the Venn diagrams depicted in a and 
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b, respectively. There appear to be a large proportion of unshared (private) 
variants in each population: between 10% and 23% of the total number of 
variants in a given population. The proportion of novel variants was high, with 
Ethiopia showing the greatest proportion of novel variation. Most of the novel 
variation appears to be unshared and rare. 
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data set. PC1 shows a cline among several African populations, most likely to structure between, and within African populations. 
represent Eurasian gene flow (n = 1,481). PC2 shows a clear separation 
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Extended Data Figure 3 | The first ten principal components for the global 
data set, including populations from the 1000 Genomes Project. PC1 shows 
a cline among several African populations extending towards European 
populations, most likely to represent non-SSA gene flow (n = 2,864). PC2 
shows a clear separation between European and Asian populations. Subsequent 
PCs show more detailed structure between populations globally, and within 
African populations. GBR, British in England and Scotland; ACB, African 
Caribbeans in Barbados; ASW, Americans of African ancestry in southwestern 
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USA; CDX, Chinese Dai in Xishuangbanna, China; CEU, Utah residents with 
Northern and Western European ancestry; CHB, Han Chinese in Beijing, 
China; CHS, Southern Han Chinese; CLM, Colombians from Medellin, 
Colombia; FIN, Finnish in Finland; GIH, Gujarati Indian from Houston, Texas, 
USA; IBS, Iberian population in Spain; JPT, Japanese in Tokyo, Japan; KHV, 
Kinh in Ho Chi Minh City, Vietnam; MXL, Mexican ancestry from Los 
Angeles, USA; PEL, Peruvians from Lima, Peru; PUR, Puerto Ricans from 
Puerto Rico, and TSI, Toscani in Italy. 
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Extended Data Figure 4 | The first ten principal components for the 
global extended data set, including populations from the 1000 Genomes 
Project, Human Genome Diversity Project, North African and Khoe-San 
population groups. PC1 shows a cline among several African populations 


extending towards European populations, most likely to represent non-SSA 
gene flow (n = 3,202). PC2 shows a clear separation between European and 
Asian populations. Subsequent principal components show more detailed 
structure between populations globally, and within African populations. 
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Extended Data Figure 5 | Projection of principal components to assess 
admixture among African populations. a, The projection of principal 
components calculated on YRI and CEU from the 1000 Genomes Project onto 
the African populations. The AGVP populations are seen to fall on a cline 
between YRI and CEU, with Ethiopian populations closest to CEU. This is 
suggestive of Eurasian ancestry among these populations. b, The projection of 
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principal components calculated on YRI and Ju/*hoansi onto the AGVP and 
other Khoe-San populations. The AGVP and Khoe-San populations are seen to 
fall on a cline between YRI and Ju/hoansi, with Zulu and Sotho leading the 
cline among the AGVP populations. This is suggestive of HG gene flow among 


these populations. 
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Diversity Project, North African and Khoe-San samples. Cluster K = 2 of East, West, North and South African ancestral components n = 3,202. 
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confidence intervals for different AGVP populations estimated with MALDER admixture estimated with MALDER for the same populations using 
(see Supplementary Note 5). Circular markers with a line drawn around them __ high-quality imputed data to improve resolution. 
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a) 


Extended Data Figure 8 | Loci with marked allelic differentiation either 
globally or within Africa. The derived and ancestral alleles are depicted in blue 
and red, respectively, for all loci. a, The global distribution of the non- 
synonymous variant rs17047661 at the CR1 locus implicated in malaria 
severity. This locus was noted to be among the most differentiated sites (in the 
top 0.1%) between Europe and Africa. b, The global distribution of the 
1810216063 SNP at the AQP2 locus. The derived allele appears to be the major 
allele among European populations in contrast to African populations. c, The 
allele frequency distribution of rs10924081 at the ATP1A1 locus. Marked 


differentiation is observed globally, with the derived allele noted to be the major 
allele among European populations. d, The global distribution of the risk allele 
for the SNP 1s1378940 in the CSK locus associated with hypertension. This 
locus was found to be within the top 0.1% of differentiated loci within Africa, 
and within the top 1% of differentiated loci globally. e, The allele frequency 
distribution of the rs3213419 SNP at the HP locus. f, The allele frequency 
distribution of the rs7313726 SNP at the CD163 locus. The HP and CD163 are 
among the top 0.1% of differentiated sites between malaria endemic and 
non-endemic regions in Africa. 
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frequency of the causal allele A. b, The distribution of the SORT1 causal SNP 
1312740374, with the derived allele T depicted in blue. c-f, The distributions 
of the APOLI variant rs73885319, TCF7L2 variant rs7903146, the APOE 
variant rs429358 and the PRDM9 variant rs6889665, respectively. 


Extended Data Figure 9 | The global distribution of biologically relevant 
loci used for simulation of traits to examine reproducibility of signals across 
AGVP populations. a, The frequency of the sickle-cell variant (rs334) in 
different regions globally. The blue portion of each pie chart represents the 
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Extended Data Figure 10 | The coverage obtained across the genome for 
variants at different allele frequencies for a hypothetical African genotype 
array with one million tagging variants. Different allele frequency bins are 
depicted in different colours. The lines show the coverage that can be achieved 
by imputation at different r thresholds. Coverage, here, is defined as the 
proportion of variants within an allele frequency captured above a pre-defined 
r threshold (along the x axis) after imputation. The solid lines represent the 


0.6 0.8 1.0 
2 


r 
coverage obtained with one million variants selected using the hybrid tagging 
and imputation approach, while the broken lines represent the coverage 
obtained by using a simple pairwise tagging approach to capture one million 
tagging variants. The hybrid method improves the coverage obtained, 
particularly for common variation. Coverage for common variants (>5%) 
appears to be high at an 1° threshold of 0.8 and above, with >80% of these 
variants accurately imputed. 
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Internal models direct dragonfly 
interception steering 


Matteo Mischiati'*, Huai-Ti Lin!*, Paul Herold', Elliot Imler?, Robert Olberg? & Anthony Leonardo! 


Sensorimotor control in vertebrates relies on internal models. When extending an arm to reach for an object, the brain 
uses predictive models of both limb dynamics and target properties. Whether invertebrates use such models remains 
unclear. Here we examine to what extent prey interception by dragonflies (Plathemis lydia), a behaviour analogous to 
targeted reaching, requires internal models. By simultaneously tracking the position and orientation of a dragonfly’s 
head and body during flight, we provide evidence that interception steering is driven by forward and inverse models of 
dragonfly body dynamics and by models of prey motion. Predictive rotations of the dragonfly’s head continuously track 
the prey’s angular position. The head-body angles established by prey tracking appear to guide systematic rotations of 
the dragonfly’s body to align it with the prey’s flight path. Model-driven control thus underlies the bulk of interception 
steering manoeuvres, while vision is used for reactions to unexpected prey movements. These findings illuminate the 
computational sophistication with which insects construct behaviour. 


Prediction and planning, essential to the high-performance control of 
behaviour, require internal models’. Decades of work in humans and 
non-human primates have provided evidence for three types of internal 
models that are fundamental to sensorimotor control: physical models 
to predict properties of the world”; inverse models to generate the motor 
commands needed to attain desired sensory states*; and forward models 
to predict the sensory consequences of self-movement”®. Without such 
model-driven control, sensory latencies would leave us grasping at things 
that had already moved and motor complexity would leave us fumbling 
with our limbs in the wrong configuration. However, while even basic 
reaching actions in vertebrates’ require predictive control, it is unknown 
whether insects rely on internal models to guide actions”*. 

Prey interception is a behaviour that is common to many insects, and 
is functionally similar to reaching. Because prey make unexpected turns, 
the predator’s interception course cannot be pre-planned, and the con- 
trol of steering has been viewed as purely reactive. Dragonflies”"’ in 
particular are thought to use a classic interception strategy known as 
parallel navigation’*’* (Fig. 1a). In parallel navigation, the pursuer steers 
so as to steadily reduce the length of the range vector from pursuer to 
prey while holding its direction constant. When prey angular velocity 
rotates the range vector, it elicits a reaction from the pursuer to stabilize 
the range vector direction (Fig. 1b). The appeal of parallel navigation is 
that it is time optimal’*, computationally simple’’, and requires no inter- 
nal models of the body or world. However, the movements forming the 
interception trajectory have a speed, accuracy and complexity” rival- 
ling those of vertebrate reaching tasks”'®. This suggests that either the 
dragonfly’s reflexes and manoeuvrability are exquisitely honed, or that 
steering is not purely reactive but is instead guided by model-driven 
control. 

We thus sought to determine whether dragonfly prey interception is 
dependent on internal models. We constructed a flight arena in which 
we could elicit interception flights to computer-controlled prey (Extended 
Data Fig. 1), and used microscopic retroreflective markers attached to 
the dragonfly, along with motion-capture techniques, to determine what 


the dragonfly saw and how its body moved during flight. The timing and 
structure of these head and body movements revealed that they were 
generated by predictions from forward, inverse and target models, com- 
bined with visual reactions to unexpected prey manoeuvring. These 
internal models provide the computational power to simultaneously 
estimate prey motion and determine the motor commands needed to 
compose an interception trajectory satisfying biomechanical constraints. 


Reactive steering to prey angular velocity 


We began our investigation of model-driven control in insect beha- 
viour by evaluating the accuracy with which simpler reactive strategies 
explain dragonfly interception steering. The basis for the parallel nav- 
igation hypothesis in dragonflies’ is the observation that their flight 
paths hold the range vector to the prey at roughly constant angle (Fig. 1c). 
Weassessed the fit of parallel navigation by examining the range vector 
and its derivative—parallel navigation requires them to be exactly anti- 
correlated, pointing in opposite directions, so that their sum yields a 
smaller range vector with the same direction. In the absence of prey 
manoeuvring, a pursuer implementing parallel navigation will quickly 
drive the range vector correlation to — 1. We calculated the range vector 
correlation for two data sets: interception flights to fruitflies (Supplemen- 
tary Videos 1 and 2), and interception flights to computer-controlled 
constant-velocity artificial prey (2 mm bead, speed 0.2-1.5ms_'; Sup- 
plementary Videos 3 and 4, Methods). The former test whether parallel 
navigation can be attained with manoeuvring prey that perturb the 
range vector direction, whereas the latter test how much faster parallel 
navigation can be attained with non-manoeuvring prey that never dis- 
turb the range vector. In both cases we found that parallel navigation 
(that is, correlations close to —1) occurred reliably only near capture 
(Fig. 1f, g). Notably, the mean time evolution of the range vector cor- 
relation for constant-speed prey was strikingly similar to that of real 
prey. Furthermore, the geometry of many interceptions approached 
parallel navigation and then deviated from it (Fig. 1g). As constant- 
speed prey do not manoeuvre, these deviations must have arisen from 
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non-parallel-navigation manoeuvring by the dragonfly, and therefore 
rule out parallel navigation as the steering strategy underlying dragon- 
fly interception. 

We excluded other reactive steering strategies based on prey angu- 
lar velocity by evaluating the general prediction of such models that 
every significant prey manoeuvre is rapidly met by a corrective man- 
oeuvre from the pursuer (Extended Data Fig. 2a). To look for such a 
temporal correlation between prey and pursuer steering, we identified 
the timing of the largest dragonfly and prey steering events as peaks in 
their acceleration (Fig. 1d and Methods). We then compared the dis- 
tribution of time lags between pairs of prey and dragonfly steering events 
toacontrol distribution that assumed steering events were random and 


« Artificial prey path 

—» Range vector 
Dragonfly: 

-© Flight path 

— Head gaze 

— Longitudinal body 
axis 


Figure 2 | Free-flight measurement of head and body orientation during 
prey interception. a, Dragonfly (Plathemis lydia) with 750 \m retroreflective 
markers on head (just above the red-tinted fovea of the eye) and 1,000 pm 
markers on body. b, 3D in-flight marker data, reconstructed from an 18-camera 
array (200 frames-per-second (fps), 150 jum root-mean-square (RMS) tracking 
accuracy in a ~2 m* volume; see Extended Data Figs 1 and 3). c, Dragonfly 
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Figure 1 | Parallel navigation and reactive control are insufficient to 
describe interception steering. a, The principle of parallel navigation is to hold 
the range vector at a fixed angle while reducing its length (schematic). b, Prey 
manoeuvring produces drift in the range-vector angle, which will be nullified by 
a pursuer using parallel navigation thereby establishing a new interception 
course (schematic). c, Three-dimensional (3D) flight path of dragonfly 
pursuing a fruitfly (50 ms steps). Stars indicate large steering events. 

d, Acceleration magnitude for the data shown in c. Stars indicate large steering 
events (see Methods). e, Distribution of latencies between prey (t = 0) and 
dragonfly steering events (155 prey events, 18 dragonflies), normalized to 
occurrence rates expected for chance levels. Dragonfly steering events 
exceeding the 2 s.d. margins occurred 47 + 13 ms after prey steering (range 30- 
70 ms), accounting for 31% of prey events. f, Range vector correlation for flights 
against real prey, for 110 successful captures (correlation = —0.99 + 0.02 at 
capture) and 30 unsuccessful flights (correlation = -0.68 + 0.31 at closest point 
to prey). g, Range vector correlation for constant-speed artificial prey 

(26 flights, 8 dragonflies, correlation = -0.98 + 0.04 just before capture). 


independent (see Methods). The difference between these distributions 
(Fig. le) revealed a significant excess of events with a mean lag of 47 + 
13 ms (+s.d., used throughout), providing an estimate of the reactive 
visuomotor latency through the dragonfly nervous system. However, 
approximately 70% of the prey steering events were not followed by a 
detectable dragonfly response, suggesting that in these cases the dra- 
gonfly response was either disproportionately small or occurred with a 
variable latency. Likewise, approximately 75% of the dragonfly steering 
events were not associated with a prey steering event. These observa- 
tions directly contradict purely reactive control strategies, which predict 
a one-to-one mapping between the timing of prey and pursuer steering. 


Body orientation directs steering 


We next investigated the purpose of the dragonfly steering events that 
were not directly related to prey angular velocity. While steering can also 
be based on other aspects of prey motion, such as angular position’”"*, 
it need not be based exclusively on prey motion. For example, the drag- 
onfly’s body orientation might influence its manoeuvrability and thereby 
the structure of its flight path. Likewise, the orientation of the high- 
acuity zone of its eye might affect the detection and response to prey 
manoeuvres”. Accommodating these constraints would require that the 
dragonfly relies on internal models of its head and body, which could 
be used reactively or predictively. To determine whether such models 
guide steering, we designed microscopic retroreflective markers that 
could be attached to a dragonfly’s head and body and tracked during 
flight. This enabled us to reconstruct the time-varying three-dimen- 
sional head and body orientation of a foraging dragonfly, along with 
its position and that of its prey (see Methods, Fig. 2a-d and Extended 
Data Fig. 3). 
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body axis, direction of gaze (foveation) and direction to prey (range vector). 
Black contour indicates foveal region of the eye. d, Interception flight path with 
head and body orientations (50 ms steps); prey remains foveated throughout 
1,000° s_* turn. e, Interception flight to variable-speed artificial prey. Black 
circles indicate the trajectory of the prey had the speed jump not occurred. 
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To evaluate whether dragonfly body orientation constrained the flight 
path, we analysed how this orientation evolved over the course of the 
flight. Constant-speed prey allowed us to identify changes in body 
orientation that were unrelated to prey manoeuvres. We found that 
systematic rotations of the dragonfly’s body often made the flight path 
significantly different from the straight-line path that would have mini- 
mized the distance to the same interception point (Fig. 3). During inter- 
ception flights, dragonflies maintained their azimuthal bearing aligned 
to their body axis (—5° + 29°). As each dragonfly perched ina random 
orientation relative to the prey’s flight path, to fly towards the intercep- 
tion point typically required a turn. These turns aligned the dragonfly’s 
body axis to the prey’s direction of motion in azimuth, while stabilizing 
the elevation of the body axis at about 30° (Fig. 3a—c). Such a stereo- 
typed alignment across flights suggests a specific body orientation is 
preferred for grasping the prey. As these manoeuvres occurred, the drag- 
onfly maintained the prey’s angular position to within 25° of directly 
overhead (Fig. 3d and Supplementary Video 5). The interception tra- 
jectory thus positioned and aligned the dragonfly directly below the prey 
and thereby reduced the capture problem to one of closing the vertical 
gap with the body in a constrained orientation. 

When dragonflies were confronted with prey that made a 20-80% 
speed jump mid-flight, their steering manoeuvres were virtually indis- 
tinguishable from those to constant-speed prey. This suggests that visual 
responses to prey manoeuvres were small corrections integrated into the 
ongoing steering program focused on body alignment and prey angu- 
lar position (Figs 2e and 3 and Supplementary Video 6). 


Head orientation tracks prey position 


Dragonfly body rotations induce substantial apparent prey motion that 
complicates detection of the prey’s angular position. This problem is 
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Figure 3 | Body dynamics constrain the interception steering strategy. 

a, b, Top (xy) and side view (xz) of flight trajectories to artificial prey, including 
dragonfly body axis orientation (40 flights, 26 constant-speed, 14 variable- 
speed trials, 9 dragonflies; x = y = z = 0 is the trial alignment point, 50 ms 
before capture). The body axis makes a5 + 17° angle in azimuth anda 22 + 18° 
angle in elevation with the constant-speed prey flight path just prior to 
capture (—6 + 27° azimuth and 36 + 16° elevation, variable-speed prey). 
Note that at take-off the body starts in a random orientation. c, Time evolution 
of the azimuthal angle between the dragonfly’s body axis and prey’s flight 
path, for the population data shown in a. Vertical grey line is the alignment 
point used in a, b. d, Spherical projection of prey flight paths relative to 
dragonfly position over the interception flight. North pole represents the 
zenith; black line the 25° contour. 
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amplified because the distance from dragonfly to prey scales the prey- 
image drift from prey manoeuvres, but has no influence on that from 
dragonfly body rotations. Thus, if the dragonfly rotates its body at 
1,000° s-' (for example, Fig. 2d and Supplementary Video 5) and the 
prey makes an abrupt turnat 1 ms’ ', the angular shift in prey position 
from dragonfly motion can be 10X larger than that from the prey itself. 
Discriminating prey-image drift that arises from prey motion rather than 
dragonfly motion is the cost of not using a steering strategy, such as par- 
allel navigation, that holds prey position fixed relative to a rigidly aligned 
head and body. This problem can be solved through signal processing 
within the nervous system to cancel apparent motion, or through move- 
ments of the head to nullify such motion before detection by neurons. 
We sought to determine which solution was used and to what extent it 
relied on prediction versus reaction. 

To determine how the dragonfly solves the drift discrimination pro- 
blem, we used our motion-capture system to estimate prey-image drift 
on the dragonfly’s eye during interception flights. We first measured the 
orientation of the dragonfly’s head and then derived the angular posi- 
tion of the prey image on the dragonfly’s compound eye (which is rigidly 
fixed to the head). Separately, we quantified the three underlying sources 
of prey-image drift: rotation of the dragonfly’s body; relative translation 
of the dragonfly and prey; and rotation of the dragonfly’s head relative 
to the body (Fig. 4 and Extended Data Fig. 4). We found that the prey 
image was held steady on the dorsal surface of the head (typically within 
4° azimuth and 7° elevation) in the approximate location of the high- 
resolution optical fovea (Figs 2a and 4d)'*. We refer to the process of 
holding the image stable on the head as ‘foveation’. The prey-image drift 
on the head was on average less than 50% that from body rotations, 
indicating significant cancellation. Indeed, head rotations were a linear 
function of the angular velocity induced by body rotations and relative 
translation, with a slope close to — 1 over a range of +1,000° s ~ 7 (Fig. 5a, 
b, d, e and Supplementary Videos 7 and 8). These data confirm that 
apparent motion disturbances are nullified through head movements 
before they enter the nervous system. 


Predictive foveation 


The compensatory head rotations could be driven reactively, based on 
sensory feedback from prey-image drift, or predictively, based on in- 
ternal models of drift. To discriminate between these two possibilities, 
we examined the delay between the summed foveation disturbances 
(Fig. 4a, b) and the corrective head rotations (Fig. 4c). Insect muscle 
contractions require approximately 5 ms to produce force”’, and any 
visually driven reactive movements of the head should display an even 
larger sensorimotor lag (for example, 47 ms from vision to wing, Fig. le). 
In contrast, if the dragonfly predicted foveal drift, the head could be 
moved with zero delay for optimal cancellation, especially in the constant- 
speed prey condition in which there is no unexpected manoeuvring. 
We found that head movements occurred with an average lag of 4 + 
4 ms after the disturbance they cancelled (Fig. 5c). Such an exceedingly 
brief delay is strong evidence for the head being controlled predictively 
rather than reactively. This prediction of prey-image motion was spe- 
cific to the goal of foveation. Head rotations nullified body rotations 
that created prey-image drift in azimuth and elevation, but not body 
rotations about the range vector axis, which has no influence on fovea- 
tion (Fig 5]; compare Fig. 5d, e vs Fig. 5f). The restriction of cancellation 
to the subspace of behavioural axes relevant for foveation is consistent 
with well-known optimality principles for sensorimotor control’’”’. 
To predict prey-image drift accurately, the dragonfly should model 
both of its individual sources: drift from body rotations (Fig. 4a) and 
drift from relative prey translation (Fig. 4b). Because the sum of these dis- 
turbances is dominated by drift induced by body rotations (+1,000° s 
range, Fig. 5d,e versus +200°s ' range for relative prey translation, 
Fig. 5j, k), the compensatory head rotations clearly contained a negative 
copy of the body rotation signal. To assess the strength of cancellation of 
translational disturbances, we calculated the residual head rotation sig- 
nal; that is, the sum of the head rotation and body rotation prey-image 
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Figure 4 | Decomposition of prey-image drift into its sources reveals 
cancellation. Dragonfly head-centred spherical projections of artificial prey 
trajectories during interception (37 flights, 21 constant-speed prey, 16 variable- 
speed prey, 5 dragonflies). The intersection of the dashed black lines shows 
the anatomical mid-line, in longitude, and the 38° latitude (horizontal); this 
corresponds approximately to the centre of the optical fovea (see Fig. 2c). 
Drift magnitude is reported in root-mean-square azimuth (RMS,,im) and 
elevation (RMS.,.,) for constant-speed and variable-speed prey across trials. 
a, Prey-image drift from rotations of the dragonfly’s body: 17.1° RMS,7im and 
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Figure 5 | Head movements predictively compensate for disturbances to 
foveation. (5 dragonflies, 21 constant-speed artificial prey flights). a, b, Time 
evolution of the prey-image drift rate from summed foveation disturbances 
(Fig. 4a, b), dragonfly head rotation (Fig. 4c) and actual drift on the head 
(Fig. 4d) for the blue trial in Fig. 4. c, Cross correlation between summed 
foveation disturbances and head rotations (mean lag across trials, 4 + 4 ms). 
d, e, Scatter plots of summed foveation disturbance and head rotation drift rate 
(all trials). Linear fits are compared to the ideal cancellation indicated by a slope 
of —1. Azimuthal disturbances are cancelled by 70% (slope —0.84 + 0.01) 
versus 30% for elevational ones (slope — 0.61 + 0.03). f, Scatter plot of body 


336 | NATURE | VOL 517 | 15 JANUARY 2015 


Head rotation d 


Prey-image drift on head 


— Constant-speed prey 
all trials 
Variable-speed prey 
all trials 

— Constant-speed prey 
(median trial) 

— Variable-speed prey 
(median trial) 

— Constant-speed prey 
(best trial) 


Ly 


12.8° RMS.i.,, for constant-speed prey (17.3° RMS,,im and 12.6° RMS, je), for 
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prey (14.6° RMS,,im and 8.0° RMS.i.,, for variable-speed prey). d, Prey-image 
drift on the head and eye, equal to the sum of a-c: 3.8° RMS,,im and 7.0° 
RMS.i-y; for constant-speed prey (3.4° RMS.yim and 7.8° RMS, iey, for variable- 
speed prey). See Extended Data Fig. 4. 
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rotation versus head rotation around the range vector shows little cancellation 
(slope —0.16 + 0.02). g-k, Time evolution (g, h), cross correlation (i) and 
scatter plots (j, k) of the prey-image drift rate from relative prey translation 
versus residual head rotations. i, The mean lag across trials of the residual 
head rotations is —3 + 25 ms. The cancellation of relative prey translation is 
not as good as that for body rotations. However, slopes of linear fits are still 
close to the optima of —1 (j, slope —1.17 + 0.05 azimuth; k, slope — 1.05 + 0.08 
elevation). 1, Axes of rotation producing azimuthal and elevational prey drift 
on the head, and the range vector axis about which rotations produce no prey 
drift (see Extended Data Fig. 4). 
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drift rates. This signal was free of the large body rotation prediction and 
could therefore be examined for its correlation to the smaller trans- 
lational drift. We found that the prey-image angular velocity arising from 
the residual head rotations was on average equal and opposite to that 
of the translational drift rate (Fig. 5g, h, j, k and Extended Data Fig. 5, 
and Supplementary Video 9), indicating that the residual head rotations 
clearly contained either a predictive or a reactive response to transla- 
tional drift. Residual head rotations occurred on average 3 + 25 ms be- 
fore the translational disturbance they were cancelling (Fig. 5i), far earlier 
than the visual delay expected for reactive steering (Fig. le). These obser- 
vations argue that the compensatory head movements for translation 
of constant-speed prey must also be predictive. During the interception 
of variable-speed prey, foveation was still maintained (Fig. 4d), demon- 
strating that the dragonfly’s head rotated reactively when unexpected 
prey motion occurred, consistent with earlier work!°. Head rotations 
thus seamlessly incorporated both predictive and reactive control, and 
closely approximated an ideal tracking system. 


Discussion 


It is a longstanding question® as to whether insects use internal models, 
such as those known in vertebrate sensorimotor tasks’. Traditional 
views have held that invertebrate behaviours, including prey intercep- 
tion”’, are based solely on reactive control. Even in mammals and birds, 
interception is often described with no predictive or model-driven com- 
ponents'*”’, Dragonfly steering in particular has been thought to be 
implemented by a neural autopilot" that uses prey angular velocity to 
steer the wings reactively and hold the prey image stable on the eye, 
thereby implementing parallel navigation’®. Here we have shown that, 
in striking disagreement with this classic model, the prey-image drift 
that would drive such an autopilot is minute and poorly correlated with 
prey angular velocity (Fig. 4d). Moreover, the dragonfly’s steering is 
inconsistent with both parallel navigation and all other reactive control 
models driven exclusively by prey movement (Fig. 1). 

We have found that the interception flight path is substantially in- 
fluenced by biomechanical constraints. The dragonfly’s steering strat- 
egy is to align its body and bearing to the prey’s direction of motion 
while remaining directly below the prey and closing the vertical distance 
to it (Fig. 3 and Supplementary Video 1). Such a body-centric strategy 
has numerous consequences. Aligning bearing to body axis should im- 
prove the dragonfly’s speed and manoeuvrability. Aligning them to the 
prey’s flight path elicits orienting turns early in the flight but ultimately 
positions the dragonfly such that the prey drops directly into its legs, 
reducing the need for drastic turns in the final moments of capture. 
Finally, because the dragonfly approaches the prey from below, it falls 
in the prey’s blind spot and is less likely to elicit an escape response. 

Central to the interception strategy is prey foveation, the dynamics 
of which imply direct model-driven control of the head. Predictive head 
rotations nullify prey-image drift due to both self-motion and constant- 
speed prey motion with nearly zero lag (Fig. 5 and Supplementary 
Videos 8 and 9), but only in the moving visual axes that affect foveation. 
This contrasts with circuits such as the vestibulo-ocular reflex, which 
reacts to sensory feedback with a delay and counter-rotates the eyes to 
cancel self-motion in a fixed coordinate system*. Our data are consis- 
tent with the dragonfly nervous system using an efference copy’® of wing 
motor commands to drive a forward model®® that predicts the prey- 
image drift from self-motion. The efference copy alone” is insufficient 
for cancellation because steering commands do not have a simple rela- 
tion to either wing motion” or prey-image movement. Likewise, at least 
a simple model of constant-speed prey motion is needed to predict prey 
translation. An inverse model would then generate the motor com- 
mands to produce the exact head rotation that will cancel these expected 
disturbances. 

Prey foveation also imposes indirect model-driven control over body 
steering. Because foveation is so effective, there is virtually no prey-image 
motion on the eye for most of the flight (Fig. 4d and Supplementary 
Video 8). Consequently, prey angular position and velocity cannot be 
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measured solely from vision. Instead, foveation encodes prey angular 
position in the dragonfly’s head—body angles. Prey angular velocity is 
probably computed at take-off, and then estimated from memory by 
incrementing its current value with any new visual drift. The key pa- 
rameters for steering thus appear to arise primarily from prediction and 
estimation processes. Neurons that detect prey-image drift'’’” would 
be well suited to reactively correct the steering controller during epi- 
sodes of brief unexpected prey motion, rather than steer the dragonfly 
continuously. In this view, if the dragonfly’s prediction of prey posi- 
tion were perfect, the prey image would be stabilized to a point for the 
entire flight and vision would be dispensable, at least for the purposes 
of interception. A control architecture consistent with these results is 
suggested in Extended Data Fig. 2b. 

We have shown that in solving sophisticated motor control problems, 
insects rely on both predictive and reactive control, combining for- 
ward, inverse and target models with visual feedback. Such internal 
models, while not previously described in insects*”*, are analogous to 
those used in reaching tasks in vertebrates’. How such control processes 
are implemented is a question that cannot be answered solely from be- 
haviour. For example, while foveation implicitly uses both forward and 
inverse models, it is unclear whether the dragonfly nervous system uses 
two distinct representations or a single one with the same function’®. 
While neural correlates of internal models have been described in pri- 
mates’, their underlying representations are highly distributed and circuit- 
level understanding remains elusive. Smaller insects, such as Drosophila”, 
have not yet been shown to use such control strategies, and their size 
would make the requisite free-flight measurements challenging. In con- 
trast, dragonflies have stereotyped and accessible neural circuitry, and 
are large enough to permit free-flight measurements of position, body 
state and neural data”. The combination of these elements is rare, and 
the dragonfly may thus be a system where internal models can be dis- 
sected behaviourally and then mechanistically reconstructed from anal- 
ysis of the neural circuit dynamics. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Animal husbandry. Experiments were conducted with the dragonfly Plathemis lydia 
(males and females, without distinction). Nymphs were collected from ponds near 
Janelia Research Campus, reared in an indoor aquatic facility, and fed brine shrimp 
and black worms. After emergence, dragonflies were inserted in a flight arena re- 
producing naturalistic conditions (see below), where they foraged on free-flying 
Drosophila virilis. Roughly 90% of the dragonflies foraged regularly and steadily 
gained weight, often reaching weights >400 mg within 2 weeks (from an average of 
220 mg at emergence). For kinematics experiments, we selected animals weighing 
>300 mg that could easily carry the approximately 15 mg mass comprising retro- 
reflectors and frame (see below). 

Flight arena. We designed an indoor dragonfly flight arena with naturalistic visual 
cues and controlled light, heating and humidity (Extended Data Fig. 1). Dragonflies 
were able to live and forage successfully for weeks in this facility. The dimensions of 
the room were 5.5 m X 4.3 m X 4.6 m (length X width x height). Naturalistic scenery 
on the walls provided optic flow cues needed by dragonflies for stable flight. An array 
of 36 high-intensity-discharge (HID) lights was used to illuminate the room. 350 W 
bulbs (Philips CDM-TMW) were housed in pairs driven by a 220 Hz ballast. Flicker 
amplitude of the lighting system was negligible. The room was supplemented with 
additional UV lighting (Sylvania 350 Blacklight). The HID array produced con- 
siderable heat, both from the bulb fixtures as well as the emitted light. To allow for 
precise temperature control in the flight arena, the lights were segregated outside 
the arena in a box that enclosed the perimeter of the room at a height of 3.4 m. The 
inner walls of this sealed box were made of transparent acrylic. HID lights were 
aimed through the acrylic so they pointed upwards at roughly 45°, towards the tex- 
tureless white ceiling of the flight arena. Typical illumination levels in the flight 
arena were approximately 10 mW cm”. Because the light box was sealed, each of 
its four walls could be coupled to an air-conditioning unit that cooled the ballasts 
and fixtures without creating any wind within the flight area. Finally, chilled water 
pipes concealed within the ceiling of the flight arena provided passive heating/cooling 
of the interior space, allowing temperature and humidity control (31 °C, 55%), com- 
parable to summer conditions. A single foraging platform (0.6 X 0.6 m, raised 0.3 m 
off the floor) was placed in the centre of the flight arena. Insect prey was attracted 
to the platform by placing dishes of fly food nearby. The prey in turn attracted 
dragonflies to the platform to forage. In a second set of experiments, we installed a 
custom-made artificial prey delivery system above the platform and a nylon net- 
ting covering the feeding area (1.5 X 1.5 X 2m). The netting kept the experimental 
dragonflies (with retroreflective markers) in the foraging area. 

Centroid tracking system. Two Photron SA1 cameras were mounted at a height 
of 3.4m on orthogonal walls, and were aimed at the perching platform. This geo- 
metry provided a field-of-view of approximately 1m’. Flights were filmed at 
1,000 frames-per-second (fps), with a 1/2,000 s shutter speed. Identification of drag- 
onflies and prey in the videos was made straightforward by the high contrast against 
the grey acrylic background provided by the platform. The software DLTdv3 (ref. 30) 
was used to calibrate the cameras (via the positions of nine static markers mounted 
on the perching platform), track the centroids of dragonfly head and prey body 
within each camera’s view, and, from this data, reconstruct the time-varying three- 
dimensional (3D) position of each participant. The dragonfly head centroid was 
used, rather than its body, as it was less occluded by wings, etc., and hence could be 
tracked more reliably. Camera residuals were approximately 150 jm at the height 
of the perching platform, confirmed by tracking beads moved on a calibrated mi- 
cromanipulator. Tracking uncertainty was less than 500 kum at all locations in the 
volume. 

Micro retroreflective markers. We used motion-capture techniques and novel re- 
flective markers to measure the full 3D head and body kinematics in foraging drag- 
onflies. At the core of the system are custom-made retroreflective markers (750 um 
and 1,000 j1m diameter) that are light enough to be carried by a flying dragonfly 
(1.2 mg for the 750 um marker) but reflective enough to be visualized and tracked 
by acamera array at a distance of up to 4 m. The retroreflectors are ball lenses (Knight 
Optical, Lasfn-35), with a refractive index of 2.0—optimal for a spherical retro- 
reflector. The ball lenses were coated with an antireflective layer matched to the 
790 nm spectrum of the motion-capture strobes. Moulds were fabricated to isolate 
one hemisphere of the ball lenses, allowing it to be mirrored with silvering solution 
(Angel Gilding, Oak Park, IL). The final retroreflector was a refractor-reflector pair, 
with the refractive outer surface focusing light onto the mirrored back surface, where 
it would reflect back to the outer lens and refract outwards on the original line-of- 
sight path of the light source. Retroreflectors were attached to the dragonfly using 
UV cured superglue (Loctite 4305), either as isolated 750 jum markers on each side 
of the head, or as a 3-marker (1,000 jm each) rigid carbon frame on the body (the 
body-marker frame, mass 7 mg; Fig. 2a and Extended Data Fig. 3a, b). The rigid frame 
fixed the 3-marker geometry as a right-angled triangle, and could be easily tracked 
by the cameras. It also allowed a frame-by-frame 3D reconstruction test of calibration 
accuracy (see below). The spherical retroreflectors had 100°-150° of functional 
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coverage, and great effort was made to ensure that the attached marker orientations 
were optimal for camera visibility during typical body and head rotations made by 
a flying dragonfly. The attachment of the body marker frame did not appear to affect 
foraging. The effect of head marker attachment varied across individuals. Some 
dragonflies did not forage, and displayed excessive grooming. Other dragonflies 
sporadically attempted foraging but were unsuccessful. These failures probably re- 
flected either a mechanical disturbance from the markers (for example, an imbal- 
ance in their centre of mass), or a disturbance to sensory hairs. For roughly half the 
animals, there was no detectable effect on flight performance and the dragonflies 
continued to capture rapidly moving prey successfully (compare Fig. 1c to Fig. 2d). 
It is possible that placement of markers on the head created a slight imbalance, and 
hence the quality of foveation and disturbance cancellation might be even better in 
a dragonfly without head markers. 

Motion-capture system. An 18-camera commercial motion-capture system running 
at 200 Hz (shutter: 1/3,000 s) was customized to track the retroreflective markers 
(Raptor-4; Motion Analysis Corp). The original strobes on cameras were replaced 
with 250 790-nm LEDs. Infrared (IR) notch filters (Chroma, part et790/30_52CR) 
were used to ensure that all the light the cameras received was driven by the IR 
strobes and not the visible HID lighting used to illuminate the flight arena. Com- 
mercial motion-capture software (Cortex, Motion Analysis Corp.) and custom soft- 
ware written in MATLAB (MathWorks, Inc.) were used to identify the 3D marker 
data based on the known distances between the body markers (6 mm, 8 mm, 10 mm) 
and, separately, between the two head markers (roughly 5 mm). Camera residuals 
were typically 150-200 um over a 1m X 1m X 2m volume. Usinga calibrated mi- 
crometer, we found displacements as small as 50 jm could be reliably tracked for 
stationary beads. The camera calibration drifted over time, and we recalibrated the 
system as often as needed to maintain the average tracking residual of less than 
200 um (for the 1,000 jzm markers). The rigid body-marker frame allowed us to 
assess the tracking accuracy during flight by analysing the jitter in the measured 
length of the carbon fibre rods; this was found to be 100-200 kum, consistent with 
the camera residuals. Because the cameras track the marker centroids in hardware, 
the data load of the system is small. This permits long flight durations to be re- 
corded, as well as the possibility of closed-loop control. 

Artificial prey delivery system. To elicit foraging flights with controlled prey 
statistics, we designed a system to deliver artificial prey to the perched dragonflies 
(Extended Data Fig. 1). A 2 mm retroreflective bead on transparent fishing line 
was actuated by a computer-controlled, height-adjustable pulley system. The bead 
was moved at either constant speed (0.2-1.5ms_') or with one or more pre- 
programmed speed variations mid-trajectory (amplitude 20-80% of the initial speed); 
these metric speeds, and the bead size, were based on real prey data. In all cases the 
artificial prey’s direction of motion was constant. For analysis purposes, we clas- 
sified as constant-speed prey the trials in which the maximum deviation of prey 
speed from its mean value was <10% (in the period following dragonfly take-off). 
All the other trials were classified as variable-speed prey. The 3D position of the 
bead was tracked with the motion-capture system described above. Dragonflies 
took off after approximately 10% of the artificial prey we presented, and approxi- 
mately 50% of these pursuits led to successful capture. The low take-off rates may 
be due to the strongly linear motion of the artificial prey which rarely exists in 
nature. For the purpose of this study, such prey motion was needed to rule out the 
parallel navigation model and to analyse foveation quality for fully predictable 
prey motion. 

Data processing and analysis. All data processing and analysis was performed in 
MATLAB (Mathworks), unless otherwise specified. A sufficient number of dra- 
gonflies and flights were collected for each condition such that all results could be 
reproduced robustly. No statistical method was used to predetermine sample size. 
Pre-processing and analysis of real prey trials (centroid tracking). Three- 
dimensional trajectories of dragonfly (head centroid) and Drosophila (body centroid) 
were reconstructed as described above with DLT dv3 (ref. 30). Missing samples (for 
example, due to visual obstruction) were patched taking the corresponding sam- 
ples from a cubic spline fit of the overall trajectory (gaps >45 ms were not used). 
Trajectories were then filtered with a finite impulse response (FIR) low-pass filter 
with bandwidth of approximately 20 Hz (order 160, cutoff frequency 10 Hz) and 
corrected for the linear phase shift. This choice of filter removes not only high- 
frequency noise but also the small effects of individual wing beats on the traject- 
ories, which are not addressed in this study (prey motion relative to the dragonfly 
does not occur in this bandwidth and hence could be filtered out by the dragonfly); 
the absence of phase distortion guarantees that the manoeuvring timing, which is 
a main focus of this research, is not affected. To minimize edge effects at the moment 
of capture (since the prey disappears here and the time series ends), the raw data 
were extrapolated (cubic spline) by the half-width of the FIR filter. Numerical de- 
rivatives were computed using a Savitsky—Golay filter of length 11 and order 4, and 
corrected for linear phase shift. 
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We defined the dragonfly take-off time as the first sample at which the vertical 
speed of the dragonfly increased above 0.15 ms’. Capture time was defined as the 
last sample at which the prey position was available, for successful trials, or the sam- 
ple of minimal distance, for unsuccessful trials. 140 real prey flights (110 successful 
captures, 30 close misses with minimal distance <40 mm) from 18 dragonflies were 
used in the analysis; the median flight duration in these data was 310 ms (maximum 
~1,000 ms). In the analysis of range vector correlation (Fig. 1f, g), we excluded 
times preceding dragonfly take-off, when changes in the range vector only depend 
on the prey trajectory. 

To analyse the timing relation between prey and dragonfly steering in success- 
ful trials, we identified as significant events the peaks in acceleration magnitude 
that were larger than the typical acceleration, given by the median across all trials 
(5.46ms * for prey, 10.86 ms * for dragonfly). We excluded dragonfly accelera- 
tion peaks within the first 25 ms after take-off, that arise systematically due to the 
lift-off manoeuvre, as well as those occurring within the last 20 ms before capture, 
as they reflected the prey grasping, non-interception portion of the flight (Sup- 
plementary Video 1). We identified 155 prey steering events and 190 dragonfly 
steering events across the 110 successful trials. 

To identify how many of the dragonfly steering events were likely to be direct 

responses to prey steering events, we analysed the lag distribution between pairs of 
prey-steering and dragonfly-steering events occurring within the same trial (n = 
332 event pairs). We also computed the control distribution expected if the timing 
of prey and dragonfly steering events were independent of each other. The control 
distribution was estimated by replacing the prey steering events within each trial 
with an equal number of events uniformly distributed in time, and averaging over 
1,000 iterations. The control distribution was then subtracted from the empirical 
lag distribution. The results of the steering timing analysis (Fig. le) were robust 
to changes in the criteria used in selecting the steering events (that is, minimum 
threshold for acceleration peaks). 
Pre-processing of retroreflective marker trajectories (motion capture). The iden- 
tified 3D marker trajectories were obtained as described above using Cortex (Motion 
Analysis Corp.) and MATLAB (Mathworks). Missing samples (from untracked 
markers, for example, due to occlusion) were patched via a cubic spline fit of the 
overall trajectory, excluding gaps >25 ms. Trajectories were then filtered with an 
FIR low-pass filter with bandwidth of about 20 Hz (order 32, cutoff frequency 
10 Hz, comparable to the filter used earlier for centroid tracking) and corrected for 
the linear phase shift. To minimize edge effects at the moment of capture (since the 
prey disappears here and the time series ends), the raw data were extrapolated (cubic 
spline) by the half-width of the FIR filter. Numerical derivatives were computed 
using a central difference filter of order 9 and also corrected for the linear phase shift. 
Identification of head joint location. The connection between the dragonfly’s 
head and its thorax occurs over a very small, point-like area*', which we refer to as 
the head joint. To reduce the number of markers carried by the dragonfly’s head, 
we numerically estimated the location of the neck joint from the head and body 
markers. We estimated its position by adapting a technique developed to determine 
the centre of rotation of a ball joint”. The method numerically fit the joint as the 
centre of a collection of concentric spheres, each of which describes a sample rota- 
tion. To evaluate its performance, we applied the method to a rotating pin, and 
recovered the point of rotation with approximately 150 jum accuracy. 

To apply the method to dragonflies, we used data from the saccadic head move- 
ments made by each dragonfly before engaging in prey pursuit (Supplementary 
Video 3). These head saccades are rigid rotations of the head about the joint while 
the body remains stationary. For each dragonfly with head markers, at least 48 head 
saccade trajectories were used to infer the head joint location relative to the body 
markers. In each interception trial, the frame-by-frame position of the head joint 
was then reconstructed from that of the body markers via this transformation. One 
dragonfly was excluded from the analysis of foveation as it had insufficient head 
saccade data to accurately compute the neck joint. 

Head and body kinematics. To allow cross-animal comparison, we mapped the 
body and head reference frames to stereotypical anatomical features on each drag- 
onfly (Extended Data Fig. 3). This mapping was performed immediately after the 
retroreflective markers were attached, using a rig comprised of two colour HD cam- 
eras, or a single camera paired with a mirror. From the two close-range images of 
the dragonfly, we reconstructed the 3D relationships between markers and dra- 
gonfly anatomical features using methods described previously*’. The marker head 
frame was rotated into a fixed foveal head frame such that the x axis (roll) pointed 
towards the foveal centre (mean elevation of prey on head across all trials) used by 


each dragonfly. The body marker frame was pitched downwards into a body frame 
such that the x axis (roll) corresponded to the longitudinal axis of the dragonfly’s 
body (Fig. 2c). See Extended Data Fig. 3c for details. Euler angles representing head 
or body orientations (for example, Fig. 3), were defined based on a z-y-x Tait-Bryan 
convention; the first two angles correspond to the azimuth and elevation of the 
longitudinal body axis (xg) with respect to the world reference frame. Angular veloc- 
ities (Fig. 5 and Extended Data Fig. 5) were computed from numerical differentiation 
of the quaternion representations of such orientations. 

Analysis of artificial prey trials. Data for the body dynamics steering analysis 
(Figs 1g and 3) were time-aligned to capture and required only body markers. 
This analysis included all the trials in which the dragonfly and prey trajectories could 
be tracked at least until their minimum distance was reduced to below 40 mm. This 
threshold was typically reached approximately 30 ms before capture; the marker 
samples after this time window were sometimes lost due to occlusions when the 
body rotated to grasp the prey. This data set comprised 40 trials (26 constant-speed 
prey, 14 variable-speed prey) from 9 dragonflies. The dragonfly trajectories were 
computed at the head joint, which represents the origin of the head reference frame 
relevant to foveation, and allows comparison with the head centroid trajectories 
computed in the Photron data set (Fig. 1). Head joint trajectories were computed 
from those of the body markers as described above, using animal-specific transfor- 
mations when available (5 head-marked dragonflies) or average transformations 
otherwise (4 dragonflies without head markers). Take-off and capture times were 
defined as in real prey trials. 

The prey foveation and head-control analysis (Figs 4 and 5) required both head 
and body marker data. The foveation data set comprised 37 flights (21 constant- 
speed prey, 16 variable-speed prey, from 5 dragonflies). The data were time-aligned 
to take-off and used 50 ms of data before take-off as foveation was already estab- 
lished and compensating for preparatory body movements. The 50 ms immediately 
before capture were omitted from analysis as the large body rotation initiated in this 
time interval (to grasp the prey) shifts the prey to outside the field-of-view of the 
dorsal eye. 

In Fig. 4d we showed the actual prey trajectories as seen in the anatomical head 
frame, assuming the prey is a point. This is justified by the fact that at an average 
distance of 400 mm, a 2 mm prey subtends 0.3°—slightly smaller than the approx- 
imate angular size of a single ommatidium in the high-resolution portion of the 
eye’’. In Fig. 4a-c we showed the trajectories that would have been produced by 
each drift source alone: rotation of the body, translation of the prey relative to drag- 
onfly, and rotation of the head relative to the body. Each trajectory was computed 
by numerically integrating (Simpson’s rule) the individual terms in the drift rate 
equations (Extended Data Fig. 4), given the prey location and our measurements of 
head rotation, body rotation, etc. Numerical integration of the sum of the three drift 
terms produced trajectories indistinguishable from those obtained by direct mea- 
surements (Fig. 4d). The root mean square (RMS) estimates of prey jitter were 
computed by combining all the trials, after removing the trial-specific mean loca- 
tion of the prey during interception. 

In Fig. 5, the quantities plotted are those in the equations of Extended Data Fig. 4, 
except that the azimuthal terms were multiplied by the cosine of the elevation (a 
natural rescaling that makes azimuthal and elevational rates comparable to each 
other). To compute the residual head rotations that compensate for translational 
prey-image drift, we added the body rotation to the head rotation. This removes 
any body rotation prediction from the head rotation signal and leaves a combina- 
tion of prediction error, prey translation compensation, and other sources of noise 
(see Fig. 5g-k). The multi-dimensional cross-correlation functions showed in 
Fig. 5 were computed using a vector comprised of azimuth and elevation (but not 
the range direction), on a trial-by-trial basis. Margin of errors (+1 standard devia- 
tions) on the slopes of the linear fits were computed from bootstrapping (10,000 
iterations). The percentages of foveation disturbance reduction (Fig. 5) were com- 
puted from the median, over all trials, of the ratio between prey-image drift rate 
magnitude before and after cancellation (body rotation + relative translation ver- 
sus body rotation + relative translation + head rotation). 


30. Hedrick, T. L. Software techniques for two- and three-dimensional kinematic 
measurements of biological and biomimetic systems. Bioinspir. Biomim. 3, 
034001 (2008). 

31.  Gorb,S. N. Evolution of the dragonfly head-arresting system. Proc. R. Soc. B 266, 
525-535 (1999). 

32. Chang,L. Y. & Pollard, N. S. Constrained least-squares optimization for robust 
estimation of center of rotation. J. Biomech. 40, 1392-1400 (2007). 


©2015 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


Extended Data Figure 1 | Indoor flight arena enables recording of 
dragonfly foraging behaviour. Flight arena is a 5.5m X 4.3m X 4.6m room 
with controlled lighting (10 mW cm *, 220 Hz illumination), heating (31 °C) 
and humidity (55%), comparable to outdoor summer conditions. Naturalistic 
images on the walls provide dragonflies with the optic flow needed for flight 
stability. a-c, A raised platform (0.6m X 0.6 m) (a), used by the dragonflies 
as a foraging perch, is placed at the centre of the room, within the filming 
volume of two high-speed cameras (b, Photron SA1, 1,000 fps, 150 jum camera 


residuals) and 18 infrared motion-capture cameras (c, Motion Analysis Corp. 
Raptor-4, 200 fps, 150 um camera residuals). Insect prey (Drosophila virilis) 
are stocked in the arena and attracted to the platform by fly food dishes 

(not shown). d, An artificial prey delivery system is installed above the 
platform; this comprises a computer-controlled, height-adjustable pulley 
system (dashed yellow line) which moves a 2-mm retroreflective bead on 
transparent fishing line. A netted enclosure (1.5m X 1.5m X 2m) keeps the 
instrumented dragonflies in the filming area during experiments. 
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Extended Data Figure 2 | Classical reactive steering versus model-driven 
interception steering in the dragonfly. a, Classical reactive steering: prey- 
image drift directly drives steering commands to maintain a desired strategy, 
such as parallel navigation (Fig. 1a). Dragonfly motion and prey motion alter 
prey angular velocity and position, thereby driving the next round of steering. 
b, Conceptual model summarizing the proposed control architecture used 

by the dragonfly—a predictive steering pathway (red) and a visually driven 
reactive steering pathway (blue) are needed to explain our results. Steering of 
the body is driven by prey position, prey angular velocity and the orientation of 
the body relative to the zenith (Fig. 3). We conjecture that these variables (the 
current prey state) are compared to a desired prey state—prey overhead, with 
the dragonfly body axis aligned to the prey’s flight path—and the state error is 
used to produce a wing command that yields movement of the dragonfly’s 
body. An inverse model may be used to transform the desired dragonfly state 
(in sensory coordinates) into a motor command; future studies will be needed 


Prey image 


drift Prey motion 


Prey angular 
velocity 


INVERSE 
MODEL 


relative to head 


Body 
translation 


Wing 
commands 


Wing 
muscles 


Body rotation 


Head 
rotation 


Body rotation 


Relative 
prey 
translation 


Dragonfly translation 


Prey 
translation 


to confirm this. Head steering to stabilize the prey in the fovea (Fig. 4d) is driven 
by prey angular velocity and an efference copy of the wing command signal. 
These variables yield a forward-model prediction of expected foveal drift, which 
is passed through an inverse model to generate compensatory head rotations 
(Figs 4c and 5). Accurate foveation can be achieved in the absence of 
unexpected prey motion (for example, blue trial, Fig. 4d), eliminating prey- 
image drift on the head and hence activity of the visual reactive pathway. In this 
regime, only the predictive pathway drives dragonfly steering. The predicted 
prey position can be inferred from the head—body angles (encoded 
proprioceptively, or estimated via efference copy of neck commands), whereas 
prey angular velocity must be available as an internal state variable. When 
prey-image drift on the retina detects unexpected prey motion, the reactive 
pathway introduces corrective steering of both body and head (Fig. le) 

and updates the estimate of prey angular velocity. 
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Extended Data Figure 3 | Definition of anatomically based head and body 
reference frames. a, Position of the retroreflective markers is mapped to 
stereotypical anatomical features, which are used along with the numerically 
estimated head joint to define anatomical reference frames. b, Anatomical 
features include three points on the thorax (leftmost and rightmost tips of 
dorsal carina of episternum, painted in red for ease of identification, and centre 
of hind wing scotellum) and three points on the back of the head (upper tip of 
midline and two laterally symmetrical points, for example, the tips of yellow 
coloured bands underneath the head markers). c, A preliminary body frame is 
defined based on the triangle formed by the three anatomical points on the 
thorax, with its origin at the rear vertex of the triangle. From this vertex, the roll 
axis (xg) is defined as the direction to the midpoint between the other two 
vertices, the yaw axis (zg) as the direction orthogonal to the triangle, and the 
pitch axis (yg) as the direction orthogonal to the other two. A preliminary head 


frame, with origin at the head joint, is defined based on the head anatomical 
features. From the joint, the yaw axis (z) is the direction to the top anatomical 
point. The pitch axis (yj) is the direction pointing from the right to the 

left anatomical points on the head, orthogonalized relative to the yaw axis. 
Finally, the roll axis (x}) is orthogonal to the other two. In the schematic, the 
black solid contour on the head denotes its dorsal part, defined by this head 
frame. d, The final body frame is obtained applying a 25° pitch correction to 
the preliminary one, to compensate for the natural upward slope of the 
anatomical triangle. The corrected body axis roughly corresponds to the 
thorax—abdomen line (Fig. 2c). The final (‘foveal’) head frame is instead 
obtained by applying a pitch correction to the preliminary one so that the 
roll axis matches the average prey-image elevation during pursuit. This 
provides an estimate for the centre of the high-acuity region of the eye, and 
varies slightly across animals (38° average). 
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Extended Data Figure 4 | Analytical decomposition of prey-image drift on 
the head. The angular position of the prey image on the head is obtained by 
projecting the range vector (r) on the head reference frame (x4, Yi, Ze); WE 
define the azimuthal position as Oprey, azim and the elevational position as 
Oprey, elev. Prey-image drift (“drift’, with rates Oprey, azims prey, elev) can be induced 
by changes in the orientation of the head, and by changes in the direction of the 
range vector (caused by translation of the prey relative to the dragonfly). 
Changes in head orientation can be quantified as the sum of the absolute 
angular velocity of the body (@g, abs: body rotation’) and the relative angular 
velocity of the head with respect to the body (@y, rep ‘head rotation’). The 
contribution of relative translation to the drift rate equals the difference 
between dragonfly (vag) and prey (Vprey) velocities, scaled by the distance to the 


(dragonfly & prey) 


prey (|r|). The relevant dragonfly velocity for foveation is that at the origin of 
the head frame; for graphical clarity, we have shown the velocity at the origin of 
the body reference frame (xg, Yg; Zp) instead. The notation a? stands for the 
projection of vector a in the direction of vector b. Prey movement in the 
head frame (x4; Yes Zz) (Fig. 4) will generally produce motion simultaneously 
along all three axes. We analysed foveation within the coordinate system 
defined by (r, ¥, ¥ x r) (Fig. 5) because it decouples the directions functionally 
relevant for foveation. Rotations about v produce purely elevational drift, 
whereas rotations about v x r produce purely azimuthal drift. Translations 

in the range vector direction and rotations about that axis only change the 
angular size and orientation of the prey on the eye but cause no drift in position. 
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Extended Data Figure 5 | Dragonfly head movements compensate for both __b, elevation: m = —0.21 + 0.04, r° = 0.02) and prey translation alone 
dragonfly translation and prey translation. a-f, Scatter plots of residual head —_(c, azimuth: m = —0.60 + 0.04, 7° = 0.14; d, elevation: m = —0.02 + 0.04, 
rotation against individual prey and dragonfly translational components,  <0.01) is largely inferior to the cancellation of the relative prey translation 
and against their sum (21 constant-speed artificial prey flights). Cancellation of (e, azimuth: m = —1.17 + 0.05, r = 0.38; f, elevation: m = —1.05 + 0.08, 
dragonfly translation alone (a, azimuth: slope m = —0.38 + 0.07, r = 0.03; r= 0.14). 
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Impact jetting as the origin of chondrules 


Brandon C. Johnson!, David A. Minton, H. J. Melosh? & Maria T. Zuber! 


Chondrules are the millimetre-scale, previously molten, spherules 
found in most meteorites’. Before chondrules formed, large differ- 
entiating planetesimals had already accreted’. Volatile-rich olivine 
reveals that chondrules formed in extremely solid-rich environments, 
more like impact plumes than the solar nebula**. The unique chon- 
drules in CB chondrites probably formed in a vapour-melt plume 
produced by a hypervelocity impact® with an impact velocity greater 
than 10 kilometres per second. An acceptable formation model for 
the overwhelming majority of chondrules, however, has not been 
established. Here we report that impacts can produce enough chon- 
drules during the first five million years of planetary accretion to 
explain their observed abundance. Building on a previous study of 
impact jetting’, we simulate protoplanetary impacts, finding that 
material is melted and ejected at high speed when the impact velo- 
city exceeds 2.5 kilometres per second. Using a Monte Carlo accre- 
tion code, we estimate the location, timing, sizes, and velocities of 
chondrule-forming impacts. Ejecta size estimates* indicate that jetted 
melt will form millimetre-scale droplets. Our radiative transfer models 
show that these droplets experience the expected cooling rates of ten 
to a thousand kelvin per hour®"®. An impact origin for chondrules 
implies that meteorites are a byproduct of planet formation rather 
than leftover building material. 

Recent work shows that the isALE hydrocode’”” is capable of mod- 
elling the extreme process of impact jetting’. While previous models 
focused on impacts between strengthless fluid bodies’, here we determine 
the effect that porosity and material strength have on the jetting pro- 
cess (Methods). Using iSALE we simulate vertical impacts of initially 
fractured dunite impactors of 1%, 10%, and 25% porosity and 10-km 
diameter on flat targets at impact velocity Vimp = 1-6kms_' stepping up 
by 0.5kms_' (Methods). Later, we scale our results to larger impactor 
sizes using hydrodynamic similarity’. We find that no melted material 
(Methods) ejected above escape velocity is resolved for impacts with 
Vimp < 2.5 km s~ "assuming that Vimp/Vesc is between 0.5 and 2, where 
Vesc is the escape velocity. During the impact simulation shown in Fig. 1, 
a total mass of material equivalent to ~ 1% of the impactor’s mass is melted 
and ejected at higher than escape velocity for an assumed Vimp/Vesc = 1. 

As shown in Fig. 1, jetting melts and ejects near-surface material at 
high velocity. Thermal modelling and palaeomagnetism of chondrites 
indicate that some differentiated planetesimals had outer shells of undif- 
ferentiated material’. If this near-surface material is undifferentiated, 
jetting will produce chondrules with primitive compositions’». This is 
in contrast to other models that rely on splashing during low-velocity 
(Vimp ~ 10-100 m s— ) collisions between already molten planetesimals'*”. 
It is doubtful that ejection of previously molten and differentiated’ mate- 
rial would produce chemically unfractionated chondrules'*. Moreover, 
collisional splashing creates ‘droplets’ that are approximately 40 m in 
diameter (Methods). Previous size estimates'®, which agreed with observed 
chondrule sizes, neglected the effect of decompression heating”. 

Using the GAME Monte Carlo accretion code” we are able to deter- 
mine where and when chondrule-forming impacts will occur. We model 
a typical minimum mass solar nebula (MMSN) and a three times more 
massive nebula (3 MMSN), both of which extend from 0.4 astronomical 
units (AU) to 4 AU (ref. 20). Initially, solid bodies have a main-belt-like 


size frequency distribution between 100 km and 1,000 km in diameter”! 
(Methods). In addition to these bodies, our models include the eccen- 
tricity damping effect of nebular gas”. As Fig. 2 shows, chondrule-forming 
impacts occur about 10°-10* yr into the simulation. Assuming that the 
initial conditions of our model correspond to the time of CAI forma- 
tion tcay this timing of chondrule formation is consistent with the age 
of the oldest chondrules, toa; + 0.4 million years (Myr) (ref. 22). 
The target bodies for impacts with vimp < 2.5kms~ ‘ are planetary 
embryos more massive than the Moon. This explains how impacts, 
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Figure 1 | Jetting of melted material during an accretionary impact. A time 
series showing a 1% porous projectile 10 km in diameter striking a target at 
3kms_’. Material is coloured according to its temperature at the time shown. 
The origin is the collision site. In a the jet is just beginning to form, 0.3 s into the 
impact. Panel b shows a well-formed hot jet 0.8 into the impact. Panel c 
roughly shows the end of jetting, when the projectile penetrates halfway into the 
target. The fastest ejecta have a velocity of about 6 kms_’, or twice the impact 
velocity. The figure was produced using iSALEPlot. 
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which eject much more solid material than melted material, can produce 
chondrule-rich chondrites. The only material that escapes the target 
bodies is a high-speed mixture of melted surface material (nascent chon- 
drules) and lightly shocked cold proto-matrix (matrix is the material 
in which chondrules are embedded, and shocking refers to the increase 
in material temperature and pressure by passage of a shock wave). Most 
of the lower-speed solid ejecta are retained because of the target bodies’ 
large escape velocity. The ejected chondrules and dust are decelerated to 
low relative velocities by lingering nebular gas, and preferentially accrete 
onto smaller planetesimals, which collectively have a larger surface area 
than more massive bodies”’. 

Chondrule formation stopped sometime around 5 Myr after toa; 
(refs 1 and 22). In our models, large impacts continue to occur even after 
5 Myr. However, chondrules are ejected at velocities above about 
2.5kms ‘ and will have dynamically excited orbits. Without signifi- 
cant gas present to reduce the eccentricities and inclinations of their 
orbits, chondrules will break up as they collide with one another or larger 
bodies at velocities comparable to their ejection velocity of a few kilo- 
metres per second. Astronomical observations suggest that the mean 
lifetime ofa primordial protoplanetary disk is about 3 Myr (ref. 24). Melt 
ejected below escape velocity is bound to the target bodies, which are 
too large to be disrupted by impacts, and must be dynamically removed 
from the asteroid belt”. 

The earliest chondrule-forming impacts occur closer to the Sun and 
the position where chondrule formation occurs moves outward with 
time (Fig. 2). 5 Myr into accretion, the outermost chondrule-forming 
impacts occur within 2.2 au and 3.9 Au, respectively, for the MMSN and 
3MMSN models (the main asteroid belt extends from ~2 AU to 3 AU). 
With masses of 107°-107° kg, the impacting bodies range in size from 
nearly that of Vesta to larger than the Moon (Fig. 2a). The maximum 
impact velocity and Vimp/Vesc increase as time goes on (Extended Data 
Fig. 1). As Vimp/Vesc increases, fractionally more unmelted material is 
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Figure 2 | Timing and location of chondrule-forming impact. Chondrule- 

forming impacts with velocities above 2.5kms_ for the MMSN model (a) and 
the 3MMSN model (b). The points are coloured according to the logarithm of 
the mass of the impacting body marked on the corresponding colour bar. 
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ejected at higher than escape velocity. Thus, our model predicts that 
chondrites formed further away from the Sun should form later and 
on average be richer in matrix than those that formed closer to the Sun. 

Using hydrodynamic scaling and estimates of the melt mass ejected 
above escape velocity as a function of Vimp/Vesc for each of our iSALE models 
(Methods), we estimate the mass of chondrules created by each accre- 
tionary impact occurring in our GAME simulation. The 287 chondrule- 
forming impacts in the MMSN model produce more than 2 X 107” kg 
of chondrules while the 4776 chondrule-forming impacts in the 3 
MMSN model make over 4 X 107° kg of chondrules for the 1% porosity 
case (Fig. 3). Jetted mass is expected to increase significantly for impacts 
on more realistic curved targets”*° and may also increase when oblique 
impacts are considered”*””. The threshold velocity of 2.5kms_ ' required 
to produce chondrules may be an overestimate for the same reasons. A 
lower threshold velocity would produce more chondrules further out 
in the disk, as does using a different size distribution for the initial con- 
ditions of GAME (Methods). Thus, the estimates shown in Fig. 3 probably 
represent a minimum approximation for the total mass of chondrules 
that impacts can produce. 

The present asteroid belt has a mass M,,, = 3 X 107 kg and is depleted 
in mass by a factor of ~1,000 from a MMSN”. This corresponds to a 
numerical depletion factor of 10-100 because most of the mass was con- 
tained in large planetary embryos*. Chondrules will preferentially accrete 
onto smaller bodies, which have a larger collective surface area than more 
massive bodies. Consequently, assuming the main-belt is about one-third 
chondrules by mass, a successful chondrule formation mechanism must 
produce ~3M,,1 to 30M,»1 of chondrules”’. Although dynamical models 
indicate that ~ 10%-30% of the asteroid belt mass may be material orig- 
inally from 1.5-2 au (ref. 25), our MMSN model only makes 8M,» in 
total. However, a 3MMSN model makes ~ 142M pp of chondrules with 
11M,» being made in the main-belt region (2-3 Av). Therefore, even with 
reasonable assumptions our lower limit estimates suggest that impacts 
produce enough chondrules to explain the current chondrule abundance. 

The igneous textures that chondrules exhibit imply that they cooled 
at rates of 10-1,000 Kh’ (refs 9 and 10). Using a one-dimensional radi- 
ative transfer code, anda geometry that approximates that of jetted mate- 
rial (Methods, Extended Data Fig. 3), we determine the average cooling 
rates of impact-produced chondrules for a range of impactor sizes (Fig. 4, 
Methods). Our simulations indicate that jetted material cools at rates of 
10-1,000 Kh! for impactors that are hundreds to thousands of kilo- 
metres in diameter. These estimates demonstrate that melted droplets 
jetted during large-scale accretionary impacts will exhibit the observed 
igneous textures of chondrules. 
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Figure 3 | The cumulative mass of chondrules created by accretionary 
impacts. The red, black and blue curves show model results for 1%, 10%, and 
25% initial porosities. The solid curves correspond to the MMSN model and 
the dashed curves correspond to the 3MMSN model. The grey line acts as a 
guide to the eye, showing the mass of the main asteroid belt, at 3 X 107’ kg. 
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Figure 4 | Chondrule cooling rates as a function of impactor size. The 


coloured lines represent different assumed droplet diameters d within the jet, as 
indicated. The crosses represent actual model runs. 


The chondrules found within a given chondrite type exhibit significant 
variations in composition”. Turbulence within the jet and chondrule- 
chondrule collisions may mix material on scales comparable to the thick- 
ness of the jet. However, the jet is composed of a mixture of target and 
projectile material and the ratio of target-to-projectile material changes 
with time’. If near surface material from the target and projectile have 
significantly different compositions, the chondrules produced by the 
impact will be chemically diverse. Expected spatial variations in chon- 
drule number density could also contribute to the observed composi- 
tional diversity”. 

An important aspect of our model is that, although chondrules accu- 
mulate to form chondrites on small bodies, the chondrules themselves 
are formed by impacts on much larger bodies. Because of this, only a 
small fraction of the mass of the terrestrial planets is processed into 
chondrules. Thus, we argue that chondrules are not the direct building 
blocks of the planets, but merely a byproduct of their accretion. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 


Received 16 July; accepted 20 November 2014. 


1. Scott, E.R. D. Chondrites and the protoplanetary disk. Annu. Rev. Earth Planet. Sci. 
35, 577-620 (2007). 

2. Kruijer, T.S. et al. Protracted core formation and rapid accretion of protoplanets. 
Science 344, 1150-1154 (2014). 

3. Fedkin, A.V. & Grossman, L. Vapor saturation of sodium: key to unlocking the origin 
of chondrules. Geochim. Cosmochim. Acta 112, 226-250 (2013). 

4. Alexander, C. M. O. & Ebel, D. S. Questions, questions: Can the contradictions 
between the petrologic, isotopic, thermodynamic, and astrophysical constraints 
onchondrule formation be resolved? Meteorit. Planet. Sci. 47, 1157-1175 (2012). 

5. Alexander, C. M. O., Grossman, J. N., Ebel, D. S. & Ciesla, F. J. The formation 
conditions of chondrules and chondrites. Science 320, 1617-1619 (2008). 

6. Krot,A.N.,Amelin, Y., Cassen, P. & Meibom, A. Young chondrules in CB chondrites 
from a giant impact in the early Solar System. Nature 436, 989-992 (2005). 


LETTER 


7. Johnson, B. C., Bowling, T. J. & Melosh, H. J. Jetting during vertical impacts of 
spherical projectiles. Icarus 238, 13-22 (2014). 

8. Johnson, B. C. & Melosh, H. J. Formation of melt droplets, melt fragments, and 
accretionary impact lapilli during a hypervelocity impact. Icarus 228, 347-363 
(2014). 

9. Lofgren, G. & Russell, W. J. Dynamic crystallization of chondrule melts of 
porphyritic and radial pyroxene composition. Geochim. Cosmochim. Acta 50, 
1715-1726 (1986). 

10. Desch, S. J., Morris, M. A., Connolly, H. C. & Boss, A. P. The importance of 
experiments: constraints on chondrule formation models. Meteorit. Planet Sci. 47, 
1139-1156 (2012). 

11. Collins, G. S., Melosh, H. J. & Ivanov, B. A. Modeling damage and deformation in 
impact simulations. Meteorit. Planet. Sci. 39, 217-231 (2004). 

12. Winnemann, K., Collins, G.S. & Melosh, H. J. A strain-based porosity model for use 
in hydrocode simulations of impacts and implications for transient crater growth 
in porous targets. /carus 180, 514-527 (2006). 

13. Melosh, H. J. /mpact Cratering: a Geologic Process (Oxford Univ. Press, 1989). 

14. Weiss, B. P. & Elkins-Tanton, L. T. Differentiated planetesimals and the parent 

bodies of chondrites. Annu. Rev. Earth Planet. Sci. 41, 529-560 (2013). 

5. Bland, P. A. et al. Volatile fractionation in the early solar system and chondrule/ 
matrix complementarity. Proc. Nat! Acad. Sci. USA 102, 13755-13760 (2005). 

16. Asphaug, E., Jutzi, M. & Movshovitz, N. Chondrule formation during planetesimal 
accretion. Earth Planet. Sci. Lett. 308, 369-379 (2011). 

7. Sanders, |. S. & Scott, E.R. D. The origin of chondrules and chondrites: debris from 
low-velocity impacts between molten planetesimals? Meteorit. Planet. Sci. 47, 
2170-2192 (2012). 

8. Taylor, G.J., Scott, E. R. D. & Keil, K. Cosmic setting for chondrule formation. In LPI 
Conf. on ‘Chondrules and their Origins’ 493, abstract 58, http://adsabs.harvard. 
edu/abs/1983chto.conf..262T (1982). 

9. Mastin, L. G. & Ghiorso, M. S. Adiabatic temperature changes of magma-gas 
mixtures during ascent and eruption. Contrib. Mineral. Petrol. 141, 307-321 
(2001). 

20. Minton, D. A. & Levison, H. F. Planetesimal-driven migration of terrestrial planet 

embryos. Icarus 232, 118-132 (2014). 

21. Morbidelli, A., Bottke, W. F., Nesvorny, D. & Levison, H. F. Asteroids were born big. 
Icarus 204, 558-573 (2009). 

22. Connelly, J. N. et al. The absolute chronology and thermal processing of solids in 
the solar protoplanetary disk. Science 338, 651-655 (2012). 

23. Hood, L. L. & Weidenschilling, S. J. The planetesimal bow shock model for 
chondrule formation: a more quantitative assessment of the standard (fixed 
Jupiter) case. Meteorit. Planet. Sci. 47, 1715-1727 (2012). 

24. Evans, N. J. |. et al. The Spitzer c2d legacy results: star-formation rates and 
efficiencies; evolution and lifetimes. Astrophys. J. 181 (Suppl.), 321-350 (2009). 

25. O’Brien, D.P.,Morbidelli, A. & Bottke, W. F. The primordial excitation and clearing of 
the asteroid belt—revisited. /carus 191, 434-452 (2007). 

26. Melosh, H. J. & Sonett, C. P. When worlds collide—jetted vapor plumes and the 
Moon’s origin. In LPI Conf. on ‘Origin of the Moon’ 1, 621-642, http://adsabs. 
harvard.edu/abs/1986ormo.conf.621M (1986). 

27. Vickery, A. M. The theory of jetting: application to the origin of tektites. Icarus 105, 
441-453 (1993). 

28. Weidenschilling, S. J. Initial sizes of planetesimals and accretion of the asteroids. 
Icarus 214, 671-684 (2011). 

29. Hezel, D.C. & Palme, H. The conditions of chondrule formation, Part |: Closed 
system. Geochim. Cosmochim. Acta 71, 4092-4107 (2007). 

30. Cuzzi, J. N. & Alexander, C. M. O. Chondrule formation in particle-rich nebular 
regions at least hundreds of kilometres across. Nature 441, 483-485 (2006). 


Acknowledgements We thank |. Sanders for his review, which improved the 
manuscript. We gratefully acknowledge the developers of iSALE (www.isale-code.de/ 
projects/iSALE), especially G. Collins, K. Winnemann, D. Elbeshausen and B. Ivanov. 
This research was supported by NASA grant number PGG NNX10AU88G. 


Author Contributions While working on calculating the cooling rates of 
impact-produced chondrules with H.J.M., B.C.J. conceived the idea that chondrules 
could form by jetting during low-velocity accretionary impacts. D.A.M. produced the 
Monte Carlo accretion code results. B.C.J. produced the hydrocode and radiative 
transfer code results. All authors contributed to preparation of the manuscript and the 
conclusions presented in this work. 


Author Information Reprints and permissions information is available at 
www.nature.com/reprints. The authors declare no competing financial interests. 
Readers are welcome to comment on the online version of the paper. Correspondence 
and requests for materials should be addressed to B.C_J. (brcjohns@mit.edu). 


15 JANUARY 2015 | VOL 517 | NATURE | 341 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


METHODS 


Droplet sizes jetting model. Assuming that a model for melt droplet formation in 
ejecta curtains® can be extended to droplet formation in jetted material, we cal- 
culate the size of chondrules with ejection velocities equal to the impact velocity 
(Extended Data Fig. 2). However, jetting ejects material at velocities above the impact 
velocity and higher ejection velocities yield smaller droplets. Consequently, the droplet 
sizes in Extended Data Fig. 2 are the maximum size of droplets produced by jetting. 
For chondrule-forming impacts with impact velocities of 2.5-5.5 kms ' and impactor 
sizes of 100-1,000 km, jetted material will create millimetre-scale droplets consis- 
tent with the observed size of chondrules (Extended Data Fig. 2). 

In terrestrial impact ejecta deposits, larger, centimetre-scale tektites (or melt frag- 
ments) are found along with millimetre-scale melt droplet spherules®, whereas chon- 
drules are all roughly the same size’*. For terrestrial impacts, the initial dynamic 
fragmentation”! of melt forms centimetre-scale droplets*. Upon release from high 
pressure, more highly shocked material or more-volatile-rich material separates 
into vapour and liquid. Acceleration of this vapour-liquid mix produces a differ- 
ential velocity between droplets and the surrounding vapour and the balance of 
inertial forces and surface tension determines the droplet size*. The droplets formed 
in this accelerated two-phase mixture are roughly millimetre-scale for terrestrial 
impact conditions. The volatile-rich nature of chondrule precursor material may 
explain the apparent lack of tektite-like chondrules. Essentially, if any silicates are 
shock-melted there will be enough vaporized volatiles present that the material will 
be best described by a vapour-liquid mixture. 

Droplet sizes splashing model. Droplet size estimates from the splashing model 
for chondrule formation’® assume that the pv component of specific enthalpy 
h=q+t pv (where q is specific internal energy, p is pressure, and v is specific volume) 
is converted to surface energy of droplets with nearly perfect efficiency. Cavitation 
and droplet formation does not occur until p < 0 and when p = 0 the pycomponent 
of enthalpy is also zero. Thus, when ex-solution of gases is ignored, isenthalpic release 
from pressure p causes decompression heating (that is, the pv part of enthalpy is 
converted to internal energy q)’”. Even for a magma releasing to zero pressure from 
200 MPa the associated temperature change is only 50 K (ref. 19). Because there is 
no pv component of enthalpy at the time of fragmentation, the assumption that all 
enthalpy is converted to surface energy is unphysical. The actual size of the ‘drop- 
lets’ is then determined by the balance of inertial forces and tensional forces*'. For 
typical strain rates of €~ 1 h”! (ref. 16), surface tension of ¢ = 0.3Nm_/, density 


p = 3,000kgm * splashing will create ‘droplets’ with diameter d= (400/pé 2)bm 
40 m (refs 8 and 31). 

Hydrocode modelling. Extended Data Table 1 describes the input parameters used 
in our iSALE models. The number of equations of state that accurately represent 
geologic materials under the extreme conditions occurring during impacts is small. 
Here we use equations of state for dunite*’, produced by the ANEOS program”’, to 
approximate the bulk properties of the impactors and target bodies during accre- 
tion. We use dunite strength, thermal softening and porous compaction parameters 
from refs 34-36 and references therein. 

During the jetting process, material is first shocked and then adiabatically 
compressed’. Consequently, we cannot calculate the amount of material melted during 
an impact using the peak pressure. Instead, we use the post-release temperature to 
estimate the amount of melt. We consider any mass that releases to temperatures 
above 1,373 K to be potentially chondrule-forming mass. We use the solidus tem- 
perature because ANEOS does not account for the latent heat of fusion. Hence, 
temperatures above the solidus are exaggerated and cannot be trusted*’. Addi- 
tionally, ANEOS in its current form tends to underpredict the entropy of geologic 
material shocked to a given pressure”. This means that ANEOS tends to under- 
estimate the degree of melting and the temperatures after release. 

For a given impact velocity the total mass of melt ejected above some ejection 
velocity v,j is calculated by summing the mass of Lagrangian tracers with v > v,; 
that also release to temperatures above 1,373 K. For a more detailed description of 
Lagrangian tracers and how ejection velocities are determined see ref. 8. For each 
impactor and target porosity, we created lookup tables for the melt mass ejected at 
greater than escape velocity as a function of impact velocity and escape velocity. These 
tables cover Vimp/Vesc = 0.5-2 stepping by 0.1, and Vimp = 2.5-6 km s stepping 
by 0.5kms~’. Using these tables and hydrodynamic similarity’, we estimate the 
amount of melt created by all of the impacts occurring in the GAME model. For 
impacts with Vimp/Vesc and/or Vimp falling between the data points from our iSALE 
models, we use bi-linear interpolation to estimate the fraction of impactor mass 
that is melted and ejected at higher than escape velocity. 

To check the assumption of hydrodynamic similarity we ran a model with an 
impactor of 1,000 km diameter and found that the amount of melted and jetted mate- 
rial, normalized by impactor mass, did not change. We also made a run with the target 
and impactor initially having intact rock strength. This run was only minimally dif- 
ferent, less than 1%, from the results using an originally damaged impactor and target. 


Thus, we conclude that material strength has a very limited effect on jetting effi- 
ciency at least for impact velocities above 2.5 km s~ '. However, as porosity increases, 
the overall jetting efficiency decreases significantly. This is consistent with jetting 
being less efficient in more compressible materials’. 

Code availability. At present, isALE is not fully open source. It is distributed on 
a case-by-case basis to academic users in the impact community, strictly for non- 
commercial use. Scientists interested in using or developing iSALE should see http:// 
www.isale-code.de/redmine/projects/isale/wiki/Terms_of_use for a description of 
application requirements. The one-dimensional radiative transfer code used here 
is available on request from B. Johnson (brcjohns@mit.edu). GAME is available 
upon request from D. Minton (daminton@purdue.edu). 

Monte Carlo accretion. The minimum mass solar nebula (MMSN) describes a disk 
that is just massive enough to create the observed planets. The MMSN implicitly 
assumes that no mass is lost from the solar nebula during the planet formation pro- 
cess. More massive solar nebulae are often considered by dynamical modellers”. In 
the 3MMSN modela small fraction of impacts have Vimp >6 kms _ land Vimp/Vesc >2- 
In our chondrule mass calculation, we do not include these impacts. This omission 
changes our mass estimate by only a few per cent, an error that is much smaller 
than the uncertainties inherent to our calculation. 

We modelled disks with a main-belt-like size frequency distribution between 
100 km and 1,000 km (ref. 21). This size frequency distribution (SFD) was proposed”* 
because it can produce embryos of larger than the Moon’s mass before the proto- 
planetary disk dissipates (~3 Myr). The early formation of embryos is required to 
explain the mass depletion and dynamical state of the main belt*’. Other models 
show that an initially monodisperse population of planetesimals of 0.1 km diam- 
eter can create larger-than-lunar mass embryos and the observed size frequency of 
main-belt asteroids in just 10° yr (ref. 28). Because GAME tracks the accretion 
history of each object in the model, starting with planetismals of 0.1 km diameter is 
computationally unfeasible. However, we did model disks with a distribution of 
bodies generated from a Gaussian distribution centred at 100 km in diameter with 
a 50-km standard deviation (truncated at 0 km). 

The MMSN model with a Gaussian SED produced 0.86, 0.86, and 1.9 times the 

mass of chondrules made in the MMSN model with a main-belt like SFD, for 1%, 
10%, and 25% porous cases, respectively. The 3MMSN model with a Gaussian SFD 
produced 0.52, 0.45, and 0.22 times the mass of chondrules made in the 3MMSN 
model with a main-belt-like SED, for 1%, 10%, and 25% porous cases, respectively. 
We also found this Gaussian SFD produced chondrules somewhat closer to the 
Sun. For the MMSN models, by 5 Myr the Gaussian SFD produced chondrules at 
1.6 Au from the Sun, whereas the main-belt-like SED produced chondrules at 2.2 au. 
For the 3MMSN models, by 5 Myr the Gaussian SED produced chondrules at 2.4 Au, 
whereas the main-belt-like SFD produced chondrules at 3.9 au. 
Radiative transfer and cooling rates. We find the geometry ofa spherically expand- 
ing plume, as used by ref. 40, is unrealistic for jetted melt. To estimate the cooling 
rates of impact-produced melt droplets, we model the ejected material as an infinite 
sheet with a density that decreases with time. We assume this geometry so we can 
use a one-dimensional radiative transfer code to determine the cooling rates. Our 
radiative transfer code uses the diffusion approximation*’. We benchmarked the 
code using the non-equilibrium Marshak diffusion problem. Extended Data Fig. 3 
schematically shows our assumed geometry, where 


Rin(t) =Rin(to) + Vint 


Rout (t) = Rout (to ) + Vout 


We consider the case where Vin = Vimp = 3 km s )and Vou = 3.5kms |. At 2.5s into 
the impact ofa 10-km-diameter projectile, Ri, = 8.8 km and Rout = 9.5 kim. This region 
has an average density of pj, = 800 kgm’ * and an average thickness of h = 215m. 
The7 X 10'*kg of material represents 48% of the total melt mass ejected with veloc- 
ities above 3kms_'. This is the result of lower-velocity material dominating the 
mass of the jet’. When considering larger impactor sizes we use hydrodynamic 
scaling to produce different initial conditions. 

We focus on the mass-averaged cooling rate of this material as an estimate for the 
average cooling rate of chondrules created by jetting. If we were to consider faster 
parts of the plume, we would expect higher cooling rates; slower ejecta would have 
lower cooling rates. 

We assume the heat capacity of the droplets to be 1,000 J kg’ K~' and the bulk 
density of the droplets is Parop = 3,000 kgm *. The droplets are assumed to be black- 
bodies and thus have a collective opacity of « = 3$/4rdrop, where ¢ is the fraction 
of the volume occupied by the droplets and rg;op is the radius of the of the melt 
droplets. For simplicity, we neglect the opacity of any vapour that may be present, 
that is, impact-produced vapour or gas in the solar nebula. We assume that the drop- 
lets start out with temperatures of 2,000 K. We also set the constant temperature 
boundary condition to 300 K. 
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The opacity is updated using the volume calculated at each time step: 


V(t) =n(R?,,—R?,) dh 


out 


where dh = h/800 is the thickness of one of 400 equal-sized computational cells. 
Note that for the reflective boundary condition and symmetry of the problem we 
model only half of the thickness of the jet. Then the volume fraction occupied by 
molten droplets is: 

p drop V 


where 


Mo = Pye V (to) 


In our approximation of the geometry of a jet, radiation escapes only from the 
free surface of the jet. We find for impactors 100-1,000 km in size that the cooling 
of the jet takes hours to days (Fig. 4). The fastest part of the jet moves a few kilom- 
etres per second faster than the parts that only just escape the target body. Thus, the 
jet has a horizontal extent of 10*-10° km on the timescale of cooling (estimated by 
multiplying a few kilometres per second by an hour toa day). The distance over which 
radiation must diffuse is much larger than the thickness scale, which is 2-20 km 
for impactors that are 100-1,000 km in diameter. Additionally, the free surface is 
much colder (here assumed to be 300 K) when compared to the temperature of 
adjacent material along the jet. Thus, our assumption that radiative transfer along 
the jet is negligible is quite reasonable. 

Extended Data Fig. 4 shows the results of our radiative transfer code, assuming 

a droplet diameter of 1 mm and an impactor diameter of 1,000 km. Extended Data 
Fig. 4 shows that the maximum mass-averaged cooling rate, which is plotted in Fig. 4, 
occurs when the mass-averaged temperature is ~ 1,400 K. The plots also show that 
the outer part of the jet cools at about ten times the rate of the inner part of the jet. 
The outer part of the jet is not to be confused with the faster-moving part of the jet 
(see Extended Data Fig. 3), although faster-moving parts of the jet probably have 
higher cooling rates too. 
Dust enrichment and chondrule-chondrule collisions. Turbulent mixing of neb- 
ular gas and jetted material lead to chondrule-chondrule collisions. In the jet the 
turbulent velocity is approximately a few per cent of the flow velocity, which is the 
product of the jet thickness and local velocity gradient*. This velocity is indepen- 
dent of impactor size and our isALE models indicate that the turbulent velocity of 
the jet is Vu» ~5ms '. However, the turbulent velocity may be a few per cent of 
the ejection velocity, Viurp ~ 100 ms~', ifturbulence created at the jet-nebula inter- 
face is dominant. The relative velocities of inertial particles in turbulent two-fluid 
flow is”: 


is =fp 
Vrel = Vturb (1 +1.5Tp a) 
turb 


where |i» is the turbulent length scale taken to be approximately equal to the jet 
thickness of 2.2-22 km for impactor diameters of 100-1,000 km. In the midplane 
at around 3 AU, tp = 4.9 X 10°s for a 1-mm-diameter particle, with longer times 
occurring outside the midplane’. This equation is valid for inertial or heavy part- 
icles that have tp > ub / tub and when the collisional mean free path is larger than 
the correlation length“. In a turbulent flow, the correlation of velocity along a line 
connecting two points separated by / is’: 


2.2 
0.9313 
f= 0 1— 2 ) 


turb 


where dissipation rate e~v;.,, / Iu. When f = 0 the velocities are uncorrelated; f = 1 
means that the velocities at the two points are identical. The minimum value of 4 
that yields f = 1 is the correlation length. We find that at early times the collisional 
mean free path J is much smaller than the correlation length. We therefore intro- 
duce the following term to account for particles having initially partially correlated 


velocities: 


turb 


1 
Vturb ~2 
Veel = Vturb (141.5% = ) (1-f) 


Assuming a collision efficiency of unity, the average number of collisions a single 
chondrule experiences per unit time is*?: 


1 
N~ 4120? VreNc 
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-1 
so that the mean free path = (4nid?nc) . Extended Data Fig. 4 shows the 


results for number density, relative velocity, rate of collisions and cumulative 
number of collisions calculated at one-second intervals and using the bulk density 
coming from the radiative transfer sections. We note that the turbulent velocity is 
assumed to be constant, although it probably decays with time. 

While the drops are still above their solidus, the collision velocities shown in 
Extended Data Fig. 4b are low and could result in coalescence or bouncing”. As time 
goes on the relative velocities increase, but partially molten, cooling chondrules can 
survive impacts at velocities up to 100 ms_' (ref. 43). Although our simple calcula- 
tions show that individual chondrules may experience many collisions (Extended 
Data Fig. 4c, d), it is unknown what fraction of chondrule collisions will result in 
the formation of compound chondrules, and the fraction of compound chondrules 
provides only a minimum estimate of the chondrule concentrations*”’. 

Because we do not include nebular gas in our hydrocode models, the pressure in 
the jet drops below typical nebular pressures of ~0.01-100 Pa (ref. 3). However, 
even at background nebula pressures the dust enrichment of the jetted material is 
enough to explain the volatile content of chondrule olivine. If chondrules formed 
at typical nebular pressures, dust enrichments greater than 10° (where dust enrich- 
ment is the solid-to-vapour ratio relative to a system of solar composition**) are 
needed to explain the volatile content of chondrule olivine’ *. For 1-mm-diameter 
chondrules with densities of 3,000 kg m ° this corresponds to number densities 
higher than 10°-10* m7? depending on the total pressure‘. For the geometry of the 
jet described in the radiative transfer section and for a 100-km-diameter impactor, 
the number density will remain above 10*m ° until 1.2 h after ejection and drops 
below 10° m after 3.9 h (Extended Data Fig. 4a). This time scales linearly with impac- 
tor size, meaning that the bulk density of material ejected by a 1,000-km-diameter 
body remains above 10‘ m~* for about 12 hand above 10° m7 ° for 39 h. A compar- 
ison of Extended Data Fig. 4 and Fig. 4 shows that this is more than enough time for 
chondrules to cool below the solidus at ~1,400 K. Our model also predicts that 
chondrules that experience higher cooling rates (Extended Data Figs 4 and 5) will 
cool to the solidus at higher dust enrichments than those made by the same impact 
with lower cooling rates. 

Both dust enrichment and total vapour pressure are important for determining 
the volatile content of chondrule olivine’. To obtain more robust estimates of the 
time history of dust enrichment, the rate of chondrule-chondrule collisions, and 
the total vapour pressure, the dynamic interaction of jetted material with nebular 
gas is required. This requires a two-fluid hydrodynamic code that can accurately 
model the interactions of particles and gas. 
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Extended Data Figure 1 | Timing and velocity of chondrule-forming 
impact. Chondrule-forming impacts with velocities above 2.5kms_ for the 
MMSN model (a) and the 3MMSN model (b). The points are coloured 
according to Vimp/Vesc (Shown on the colour scale). Note that Vimp/Vesc may be 
less than one because v-s- is considered to be the escape velocity after the target 
and impactor have combined to form a more massive body. 
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Extended Data Figure 2 | Maximum size of droplets created by jetting. The 
different lines represent different impact velocities, as indicated. 
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Extended Data Figure 3 | Schematic showing the geometry of our radiative 
transfer models. The horizontal axis shows radial distance from the point 
ofimpact. The vertical axis marks the thickness of the jet. We model a portion of 
the jet as an annulus that moves outward radially. The width of this annulus also 
grows with time. BC, boundary condition; h, the thickness of the jet. 
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Extended Data Figure 4 | Temperature time history for a jet consisting of at 300 K. The thick grey curve is the mass-averaged temperature, which we use 
1-mm-diameter droplets created by a 1,000-km-diameter impactor. The as proxy for the average temperature of material in the plume. Panel a shows 
different coloured curves represent different computational cells, as indicated, _ the temperature as a function of time, while b shows the cooling rate as a 
where cell 1 is the innermost cell, which has a reflective boundary condition function of time. 

on one side, and cell 400 is the outermost cell, which radiates into a background 


©2015 Macmillan Publishers Limited. All rights reserved 


a b 


Vy = VS inp = 100 km 

—v, _=100m/sD_ = 100 km 
turb imp 

—v, ,=5m/sD_ = 1000 km 
tur imp 

—v, _=100m/sD,_ = 1000 km 
turb imp 


—D,_ =100 km 
imp 

—D._ =1000 km 
imp 


Number density (m7) 
Collision velocity (m/s) 


-2 -1 oO 1 2 -3 -2 —1 oO 1 2 


10 10 10 10 10 10 10 10 10 10 10 
Time (hr) Time (hr) 


——Vryry = WS a = 100 km 
=100 m/sD_ = 100 km 
turb imp 
=5m/sD. = 1000 km 
ur imp 


Vy =100 m/s Din = 1000 km 


—v_,=5msD._ =100 km 
turb imp 


—v, ,=100m/sD_ = 100 km 
turb imp 


—v, ,=5m/sD_ = 1000 km 
jurb imp 


Collision rate (hr-') 
Number of collisions 


t 


—v, ,=100m/sD, = 1000 km 
tur imp 


b 


10" -3 2 1 oO 1 2 10° -3 2 -1 oO 1 2 
10 10 10 10 10 10 10 10 10 10 10 10 
Time (hr) Time (hr) 
Extended Data Figure 5 | Chondrule density and collision rates. velocity plotted as a function of time. c, Rate of collisions a single chondrule 
a, The number density of chondrules is plotted as a function of time for experiences plotted as a function of time. d, Cumulative number of impacts a 
100-km-diameter and 1,000-km-diameter impactors. b, Relative collision chondrule experiences plotted as a function of time. 
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Extended Data Table 1 | iSALE input parameters 


* See ref. 31. 


Description Input 


Melting temperature 1373 K 


Simon A parameter 1520 MPa 


Poisson’s ratio v 0.25 


Frictional coefficient (undamaged) p° 1.58 


Cohesion (damaged) Yo° 10 kPa 


oO 
oO 
—_ 


Strain at which porous compaction begins e, 


Size of high resolution cell 12.5m 


Number of high resolution cells vertical direction 1600 


>See ref. 46 and references therein for a description of Simon parameters, the thermal softening parameter, and their implementation in iSALE. 
°See ref. 11 and references therein for a description of strength model parameters and their implementation in iSALE. 
4 See refs 12 and 47 for a description of the porous compaction model parameters and their implementation in iSALE. 
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Direct observation of electron propagation and 
dielectric screening on the atomic length scale 


S.N eppl>, R. Ernstorfer®, A. L. Cavalieri*>»°, C. Lemell’, G. Wachter’, E. Magerl’, E. M. Bothschafter®, M. J obst!?, 
M. Hofstetter’, U. Kleineberg”*, J. V. Barth!, D. Menzel’, J. Burgdorfer””, P. Feulner', F. Krausz”* & R. Kienberger’? 


The propagation and transport of electrons in crystals is a funda- 
mental process pertaining to the functioning of most electronic de- 
vices. Microscopic theories describe this phenomenon as being based 
on the motion of Bloch wave packets’. These wave packets are su- 
perpositions of individual Bloch states with the group velocity deter- 
mined by the dispersion of the electronic band structure near the 
central wavevector in momentum space’. This concept has been ver- 
ified experimentally in artificial superlattices by the observation of 
Bloch oscillations*—periodic oscillations of electrons in real and 
momentum space. Here we present a direct observation of electron 
wave packet motion in a real-space and real-time experiment, on 
length and time scales shorter than the Bloch oscillation amplitude 
and period. We show that attosecond metrology’ (1 as = 10° '® sec- 
onds) now enables quantitative insight into weakly disturbed elec- 
tron wave packet propagation on the atomic length scale without 
being hampered by scattering effects, which inevitably occur over 
macroscopic propagation length scales. We use sub-femtosecond (less 
than 107 seconds) extreme-ultraviolet light pulses* to launch photo- 
electron wave packets inside a tungsten crystal that is covered by mag- 
nesium films of varied, well-defined thicknesses of a few angstréms*. 
Probing the moment of arrival of the wave packets at the surface 
with attosecond precision reveals free-electron-like, ballistic propa- 
gation behaviour inside the magnesium adlayer—constituting the 
semi-classical limit of Bloch wave packet motion. Real-time access 
to electron transport through atomic layers and interfaces promises 
unprecedented insight into phenomena that may enable the scaling 
of electronic and photonic circuits to atomic dimensions. In addi- 
tion, this experiment allows us to determine the penetration depth 
of electrical fields at optical frequencies at solid interfaces on the 
atomic scale. 

A detailed microscopic understanding and control of electronic and 
optical properties of solids depends on our ability to access the dynam- 
ics of electrons on atomic time and length scales. Tracking the pro- 
pagation of electrons in real time requires the ability to pinpoint their 
position at a rate comparable to the time on which interactions with 
other electrons and the crystal lattice may affect their trajectories inside 
the material. In a classical picture, the upper bound for the necessary 
temporal resolution is therefore the time it takes the electrons to travel 
the several-angstrom distance between neighbouring atoms. This implies 
transit times well below one femtosecond even for kinetic energies as 
low as Exin ~ 1 eV. Previous time-resolving studies on electron pro- 
pagation in condensed matter employed laser-based spectroscopic 
techniques to reveal ballistic currents and drift motion of charge car- 
riers’ *. Restricted to the pico- and femtosecond timescales, they are 
able to probe carrier dynamics averaged over several hundreds of nano- 
metres. In contrast, the experimental approach demonstrated here offers 


quantitative real-time access to electron transport on the inter-atomic 
length scale. 

Figure 1 illustrates the basic principle of the experiment. Attosecond 
extreme-ultraviolet (XUV) light pulses generate Bloch electron wave 
packets with final-state energies substantially above the vacuum level. 
Electron wave packets with a sufficiently large momentum component 
along the surface-normal direction z contribute to the photoelectron 
current reaching the time-of-flight detector. Conventional photoelec- 
tron spectroscopy is restricted to measuring the energy and momen- 
tum distributions of photoelectrons. Here we additionally capture the 
temporal profile of the photoemission process by having the ejected 
electrons interact with the controlled few-cycle electric field ofa visible/ 
near-infrared (henceforth referred to as “NIR’) laser pulse covering the 
500-1,000 nm spectral range and synchronized, with attosecond pre- 
cision, to the XUV pulse. 

As this field modifies (‘streaks’) the momentum of a photoelectron 
in proportion to the laser vector potential A;(z, t) at the instant ¢ the 
electron enters the NIR field**"®, the temporal profile of the electron wave 
packet leaving the sample is mapped onto its final momentum distri- 
bution. Full streaking spectrograms obtained by recording these laser- 
modulated electron energy distributions as a function of delay At between 
the XUV and the NIR pulses are therefore highly sensitive to the spatio- 
temporal characteristics of both the photoelectron wave packet and the 
streaking laser field on the angstrém length scale and the attosecond 
timescale?""’. 

Direct (time-domain) access to these electronic and optical wave pack- 
ets promises a unique insight into the photoelectric effect, including 
underlying electron propagation and phenomena as fundamental as 
dielectric screening of light fields at solid surfaces. Here we show that 
this can be achieved by combining state-of-the-art attosecond timing 
metrology (chronoscopy)**""' with sample engineering on the angstrém 
level*””. 

When excited from Bloch states inside the crystal to positive-energy 
states'’, photoelectrons are not immediately exposed to the streaking 
field because the electric field amplitude is substantially reduced (or 
screened) at the topmost layer by the response of the metal electrons. 
Therefore, the time delay associated with the propagation of the re- 
spective Bloch wave packets towards the surface (included in both the 
quantum-mechanical one-step’ and the semi-classical three-step'*”° 
description of photoemission) is encoded in the streaking spectrogram”. 
Differences in the propagation times of electrons ejected from differ- 
ent initial states manifest themselves as a temporal offset between the 
respective streaking traces’’* (see Fig. 1a). Previous studies on single 
crystals revealed a considerable time delay between the emission of core- 
level and conduction band (CB) photoelectrons from the transition 
metal tungsten W(110)’, whereas such a delay was found to be absent 
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Mg/W(110) 


Figure 1 | Spatio-temporal dynamics in attosecond photoemission from 
Mg/W(110). a, Principle of the experiment: photoelectrons (green dots) are 
launched inside a tungsten W(110) crystal and a magnesium (Mg) overlayer a 
few angstroms thick by an XUV pulse of ~450 as, and are detected in ultrahigh 
vacuum with a time-of-flight analyser. At the surface, the arrival times of 
electrons released from different initial states are probed by streaking their 
associated electron energy distributions with a 2 X 10'' W cm ~ strong electric 
field delivered by a sub-5 fs broadband linearly polarized visible/near-infrared 
laser pulse. Relative time delays At developing during the propagation of 

the photoelectrons to the metal-vacuum interface are detected as temporal 
shifts between their streaked energy distributions. The time shifts At are 
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sensitive to the atomic-scale electron transport characteristics (quantified by 
the inelastic mean free path 4; indicated only for the W 4f electrons), the Mg 
overlayer thickness and the screening behaviour of the laser field at the 
solid-vacuum interface. b, Schematic energy-level diagram for the probed 
electronic transitions. The central XUV photon energy of ~118 eV allows 

the simultaneous excitation of Mg 2p, W 4f and the joint CB states (binding 
energy E,in as indicated). A background-corrected photoelectron spectrum of 
n= 4 adlayers of Mg on W(110) in the absence of the laser field is shown as 
the black solid line. For better visibility, the strength of the CB and W 4f signals 
are magnified by a factor of six. 


in the photoemission from the free-electron metal magnesium Mg(0001)”°. 
Theoretical models have addressed different contributions such as may 
arise from the band structure of the material*’®"’, the spatial charac- 
teristics of the initial-state wavefunctions'*’, and elastic and inelastic 
scattering effects””*. These models also differ from each other in the way 
that the screening of the laser field at the surface is taken into account 
when calculating the photoemission time delays'*’*”’. To isolate the 
atomic-scale electron propagation process from this multitude of dis- 
parate effects, we investigate hybrid metallic samples consisting of a 
controllable number n of Mg adlayers on a W(110) crystal'*”* (see Fig. 1a 
and Supplementary Information for details) and contrast the measured 
time shifts with electron transport calculations. 

In our experiments, XUV pulses with a duration of about 450 as and 
a photon energy of imyyy = 118 eV simultaneously generate photo- 
electrons from core states of the substrate (W 4f) and adlayer (Mg 2p), 
as well as from the energetically overlapping CB states of both materials 
(Fig. 1b). A representative streaking spectrogram for n = 4 Mg adlayers 
on W(110) is shown in Fig. 2a. Despite the ~80% attenuation of the W 


Figure 2 | Attosecond time-resolved photoemission from Mg/W(110). 

a, Representative streaking spectrogram for n = 4 Mg monolayers (ML). All 
photoelectron spectra are corrected for the inelastic electron background signal. 
The strength of the CB and W 4f signals is magnified by a factor of 6 for better 
visibility. b, Exemplary timing analysis of the Mg 2p and W 4f core-level 
electrons: the first moments calculated from their respective kinetic energy 
distributions are shown as red crosses (Mg 2p) and blue crosses (W 4f) as 
functions of NIR-XUV delay At. A global fit of the resultant streaking traces to 
a parameterized waveform for the NIR vector potential (solid lines) reveals a 
relative time shift At[4f- 2p], which can be identified with the time delay 
occurring during the release of the electrons from the metal surface. Insets 
illustrate the evolution of At[4f-2p] for 0<n=4. Regions exhibiting the 
largest gradient of the streaking field (corresponding to the highest temporal 
resolution) are highlighted. An analogous evaluation of At[CB - 2p] is 
presented in the Supplementary Information. 
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substrate photoemission due to inelastic scattering in the Mg overlayer, 
the streaked W 4f and CB photoemission lines are clearly discernible. 
They are also sufficiently separated from each other and from the Mg 2p 
line over the entire range of XUV-NIR delays, At, to guarantee an accu- 
rate quantitative analysis of their relative emission dynamics” ''. In what 
follows, we reference the emission times of the W 4f and CB electrons 
to the Mg 2p emission from the Mg overlayer and denote the resultant 
relative delays as At[4f- 2p] and Az[CB - 2p], respectively. 

We begin with an analysis of At[4f- 2p] because the involved photo- 
electrons originate from atomic-like states that are entirely localized in 
the W(110) substrate or the Mg overlayer. This allows unambiguous 
interrelation of the measured time shifts and the well-defined propaga- 
tion distances in the Mg adlayer systems, which may not perfectly apply 
to CB electrons owing to the delocalized character of the CB initial-state 
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wavefunctions'*”’, Relative time shifts At[4f-2p] extracted with a robust 
quantum-mechanical fitting scheme"! (see Supplementary Informa- 
tion) from streaking spectrograms at different Mg coverages are sum- 
marized in Fig. 3a (blue diamonds). They reveal a distinct monotonic 
increase of At[4f- 2p] with the number n of Mg adlayers, reaching 
At[4f- 2p] = 215 + 20as for a Mg film 10.4 A (n= 4) thick. We veri- 
fied this trend using a simpler analysis that compares the first moments 
of the streaked W 4f and Mg 2p energy distributions as functions of At 
(Fig. 2b; see Supplementary Information). 

The simplest description of the electron propagation is to consider 
ballistic motion of the centroid of a Bloch wave packet as a free point- 
like electron in one dimension”. For the Mg layers, At[4f- 2p] is then 
dominated by the average propagation time ty, of the 4f-derived wave 
packets with central wavevectors k = kg travelling at a group velocity 


Vif = a |,=k, Owing to the free-electron-like band-structure of 


Mg’®”*, we can assume that v4r~ \/2Exin/me, where m, is the free elec- 
tron mass. The kinetic energy of the photoelectrons inside the Mg layer 
amounts to Exin ~ 93 eV (the photon energy is A@yyy = 118 eV, the 
binding energy of the 4f electrons is E, ~ 32.5 eV, the Fermi energy of 
bulk Mg is Exermi~ 7eV), leading to an average group velocity of 
Vap~ 0.057 Aas |. We therefore expect t4y—which is the upper limit 
for At[4f-2p]—to increase almost linearly with the number of Mg 
adlayers n, according to tap~ n X d/v4g~ n X 45 as, where d = 2.6 Ais 
the interlayer spacing of the epitaxial Mg films’. A linear fit to the 
experimental data of Fig. 3a yields a delay of ~42 as per adlayer. The 
good agreement between experiment and model prediction provides 
conclusive evidence for the atomic-scale ballistic propagation of the 4f 
electrons being the microscopic origin of the observed time shifts in the 
spectrograms and also corroborates free-electron-like transport in Mg. 

This interpretation is substantiated by electron transport simulations” 
of the ballistic motion of the W 4f, Mg 2p and CB electrons in the Mg/ 
W(110) systems. In general, time delays obtained from such transport 
calculations are sensitive to (1) the average group velocities of the elec- 
trons at the relevant energies, (2) their energy-dependent inelastic mean 
free path A(E,;,,) in the traversed materials and (3) the spatio-temporal 
profile of the streaking field near the surface’. The average group veloc- 
ities can be deduced from electronic structure calculations” and all 
relevant values of A(E,in) are known from synchrotron experiments”® 


Figure 3 | Atomic-scale photoelectron transport and screening of the 
incident light field. a, Time delays At[4f- 2p] between the release of W 4fand 
Mg 2p electrons extracted from a large set of streaking spectrograms with 
different numbers of Mg adlayers are shown as blue diamonds. Error bars 
denote full standard deviations and are obtained by averaging measurements 
performed under similar experimental conditions. Fractional adlayers 
correspond to dispersed two-dimensional islands (on top of a completed Mg 
layer) that coalesce upon further Mg deposition. Green lines are time delays 
predicted by our one-dimensional (1D) simulation of the photoelectron release 
dynamics for different screening lengths 6 assuming an exponentially decaying 
normal component of the incident NIR streaking field at the metal-vacuum 
interface according to E,(z, t)=Eye~ *~*")/°. The light-green-shaded area 
highlights the screening scenarios compatible with the experiment. Red squares 
indicate time delays derived from a full three-dimensional electron transport 
model (the red line is a guide to the eye). b, The upper panel illustrates the 
different screening scenarios for E,(z, t) considered in a for the example of 
n= 2 Mg adlayers (orange shading) on W(110) (grey shading). The red line is 
the spatial variation of E,(z, t) at the interface predicted by TDDFT. The lower 
panel is a snapshot of the NIR-induced charge density at the metal-vacuum 
interface at the maximum of the laser pulse derived by TDDFT (magenta line). 
The position of the dynamic image plane Zim is indicated as a vertical black 
solid line. The lattice potential (averaged parallel to the crystal surface) 
employed in the DFT calculations is shown as a dotted black line. The positions 
of the Mg and W atoms at the interface along the surface normal are indicated 
as black dots. The gradual transitions from the W electronic structure 

(grey) to the Mg electronic structure (orange) and to the vacuum (white) 

are indicated in the background colours. c, Comparison of time shifts At[CB - 
2p] measured between the emission of CB and Mg 2p electrons with time 
delays predicted by the three-dimensional (3D) electron transport model. 
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or theory”. Asa consequence, our experiments open up the opportunity 
to explore the spatial variation of the laser field’s F(z) component normal 
to the surface (z axis) on the angstrém length scale. 

Screening at metallic surfaces becomes effective near the so-called 
image plane Z;,,; located about a half-layer spacing outside the centre of 
the topmost atomic layer**”’. As our method probes the optical near-field 
at the metal-vacuum interface, the commonly used Fresnel equations 
based on macroscopic properties of target components with perfectly 
sharp interfaces cannot be applied. Instead, a phenomenological expo- 
nential decay E,(z, t) = Eye %~*™?/° of the surface-normal component 
of the field inside the material appears to be a reasonable assumption’*”’. 
We therefore modelled the impact of different screening lengths 6, that 
is, the length scale on which the stepwise prediction of the Fresnel for- 
mula does not apply, on the time delays At[4f- 2p] using one-dimensional 
electron transport simulations (see Supplementary Information). The 
time delays At[4f- 2p] predicted by this simple model as a function of 
nand 6 are plotted as green lines in Fig. 3a. Apparently, only the range 
0<6< 3A iscompatible with the experiment and the associated error 
bars, indicating a screening within one atomic layer of Mg. 

To scrutinize the origin of this rapid interfacial screening, we used 
time-dependent density functional theory (TDDFT)” to calculate E;,(z, t) 
for the Mg/W(110) systems. The surface-normal component of the inci- 
dent laser field induces a polarization charge layer at the metal surface 
that shields the interior of the solid against the external electric field. 
The centroid of the induced screening charge density defines the exact 
position Z;,, of the image plane, which marks the microscopic onset of 
the local screening process” (see Fig. 3b). Both the positions z,,, , and 
the width 6,, of the induced screening charge are found to be almost 
independent of the number n of Mg layers. The key finding is that the 
laser field is already fully screened at the plane defined by the centre of 
the atoms of the topmost layer for all Mg/W(110) systems, in agreement 
with the conclusion drawn from the comparison of our experimental 
data with the phenomenological modelling. 

Finally, we incorporated the abrupt screening of the streaking field 
at the surface revealed by TDDFT in a full three-dimensional streaking 
simulation” of the electron propagation in Mg/W(110) (see Supplemen- 
tary Information for a detailed description). Similar to the above-mentioned 
one-dimensional model, wave packet propagation is simulated by the 
transport ofan ensemble of point-like charges taking stochastic inelastic 
and elastic scattering events into account. The time delays At[4f- 2p] 
predicted by these calculations (red squares in Fig. 3a) are in good agree- 
ment with the experiment. 

Compared to the core-level photoemission time delay At[W 4f- 2p], 
the temporal shift of the conduction band emission At[CB - 2p] (Fig. 3c) 
is distinctly smaller and exhibits a strikingly different dependence on 
the number of Mg adlayers. A detailed analysis of At[CB - 2p] within 
electron transport models is complicated by different (spectrally unre- 
solved) contributions of W(110)- and Mg-derived states to the joint CB 
feature at E,jn ~115 eV. However, by weighting the excitation proba- 
bilities from these different initial states according to their atomic photo- 
excitation cross-sections (see Supplementary Information), we achieved 
good overall agreement with the experimental results, and correctly repro- 
duced the vanishing At[CB - 2p] time delay for bulk Mg”°. This suggests 
that our approximate treatment of the Bloch wave packet propagation 
and the dielectric screening response remains valid also for more delo- 
calized initial electronic states, in contrast to recent predictions”. 

We emphasize that At[CB - 2p] for n = 1 is overestimated in our trans- 
port model and lies outside the experimental error margin. A deviation 
from the present model appears likely for the Mg/W(110) monolayer 
system, since strong mixing of band states at the interface may lead toa 
deviation of the initial-state band structure and excitation cross-sections 
from their bulk characteristics™*. A detailed discussion of this phenom- 
enon is beyond the scope of the present study, but indicates the potential 
of attosecond photoelectron spectroscopy to probe interfacial hybrid- 
ization between electronic states directly in the time domain. Therefore 
this technique may be applied to phenomena not describable by a 
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simple combination of separate electronic structure models for bulk and 
surface layers. 

This work extends the realm of attosecond spectroscopy to the direct 
observation of atomic-scale propagation and damping of electronic and 
optical wave packets at solid surfaces. The resultant insight into attosecond 
temporal and—simultaneously—angstr6m spatial dimensions opens 
the door for understanding and exploring electron transport phenom- 
ena on the atomic scale and the dielectric response of solids at optical 
frequencies. Applied to overlayer materials with non-free-electron-like 
positive-energy states, such studies will shed light on whether stationary 
band structure can be used to predict atomic-scale electron propagation 
on ultrashort timescales. Extrapolation of coverage-dependent streak- 
ing spectroscopy to the sub-monolayer regime will provide access to 
absolute photoemission times and possible intrinsic (atomic) retarda- 
tion effects in the photoemission process. Beyond addressing these 
fundamental questions, attosecond electron transport chronoscopy 
may prove instrumental in advancing electronic and photonic circuits 
towards atomic dimensions. 
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Modulation of hydrophobic interactions by 
proximally immobilized ions 


C. Derek Ma!, Chenxuan Wang’, Claribel Acevedo-Vélez!, Samuel H. Gellman? & Nicholas L. Abbott! 


The structure of water near non-polar molecular fragments or sur- 
faces mediates the hydrophobic interactions that underlie a broad 
range of interfacial, colloidal and biophysical phenomena’ ~*. Sub- 
stantial progress over the past decade has improved our understand- 
ing of hydrophobic interactions in simple model systems’*°, but most 
biologically and technologically relevant structures contain non-polar 
domains in close proximity to polar and charged functional groups. 
Theories and simulations exploring such nanometre-scale chemical 
heterogeneity find it can have an important effect*"* ”, but the influ- 
ence of this heterogeneity on hydrophobic interactions has not been 
tested experimentally. Here we report chemical force microscopy mea- 
surements on alkyl-functionalized surfaces that reveal a dramatic 
change in the surfaces’ hydrophobic interaction strengths on co- 
immobilization of amine or guanidine groups. Protonation of amine 
groups doubles the strength of hydrophobic interactions, and gua- 
nidinium groups eliminate measurable hydrophobic interactions in 
all pH ranges investigated. We see these divergent effects of prox- 
imally immobilized cations also in single-molecule measurements 
on conformationally stable b-peptides with non-polar subunits located 
one nanometre from either amine- or guanidine-bearing subunits. 
Our results demonstrate the importance of nanometre-scale chem- 
ical heterogeneity, with hydrophobicity not an intrinsic property of 
any given non-polar domain but strongly modulated by functional 
groups located as far away as one nanometre. The judicious placing 
of charged groups near hydrophobic domains thus provides a strat- 
egy for tuning hydrophobic driving forces to optimize molecular 
recognition or self-assembly processes. 

To quantify the hydrophobic component of interactions between two 
chemically defined surfaces, we compare adhesive forces measured in 
either aqueous triethanolamine (TEA, 10 mM) buffer or a mixture of 
40 vol% aqueous TEA buffer and 60 vol% methanol. We measured the 
mean adhesive (‘pull-off) force between the tip (gold-coated and func- 
tionalized with a monolayer of C,»H,,SH) ofan atomic force microscope 
(AFM) anda C,9H,9SH monolayer formed ona gold film (Figs 1a, b and 
2a) to be 4.0 + 0.3 nN in the aqueous TEA buffer, independent of pH 
between 7 and 10.5 (Fig. 2a, b and Extended Data Figs 1, 2). Johnson- 
Kendall-Roberts theory’’ and measurements of the work of adhesion 
(obtained from contact angle measurements and use of Young’s equa- 
tion'*) confirmed the magnitude of the measured force to be consistent 
with previous reports of hydrophobic interactions between non-polar 
surfaces (Methods). Addition of 60 vol% methanol to the aqueous TEA 
buffer reduced the measured adhesive force between the AFM tip and 
the C}9H9SH monolayer from 4.0 + 0.3 to 2.1 + 0.2 nN, consistent with 
past studies that have shown that addition of methanol to aqueous so- 
lutions disrupts the structure of water near non-polar surfaces and di- 
minishes or eliminates hydrophobic interactions'*’. The adhesive force 
measured in pure methanol is 1.8 + 0.2 nN (Fig. 2b), indicating that 
addition of 60 vol% methanol to aqueous TEA eliminates approximately 
85% of the total hydrophobic adhesion. The same methanol proportion 
was reported to disrupt hydrophobically driven association of amphi- 
philic oligopeptides’*’”. Although the force of 2.1 + 0.2 nN measured 


between the AFM tip and the C,)H,9SH surface in 60 vol% methanol 
(Fig. 2b) thus contains some residual hydrophobic interactions (0.3 + 
0.2 nN; see Methods), it arises mainly from van der Waals interactions 
(1.8 + 0.2 nN). 

We next used 60 vol% methanol addition to identify hydrophobic 
components of the adhesive interaction between an alkyl-terminated 
AFM tip and either AmC, ;H2.SH monolayers or mixed monolayers com- 
prised of ammonium (Am)- and alkyl-tipped thiols (40% AmC,,H,.SH 
and 60% C; 9H, SH; Fig. la and Extended Data Fig. 3). Adhesion in- 
volving the pure AmC,,;H2.SH monolayers (Fig. 3a, c and Extended 
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Figure 1 | Experimental systems used to investigate the effects of 
immobilized charge on hydrophobic interactions. a, Single-component and 
mixed, self-assembled monolayers presenting alkyl, Am and Gdm functional 
groups. b, Representative pull-off force curve measured when retracting an 
alkyl-terminated AFM tip from a CjoH2;SH monolayer in 10 mM TEA at pH 7. 
c, Linear (top) and helical (bottom) representations of the globally amphiphilic 
(GA) B-peptides used in this study. d, Schematic illustration of the interaction 
of an alkyl-terminated AFM tip with an immobilized helical B-peptide with 
side chains colour-coded to match c. e, Representative pull-off force curve for 
an alkyl-terminated AFM tip retracting from a surface presenting GA-Lys 
immobilized on a monolayer displaying 0.1% hydroxyl-terminated and 
amine-terminated tetraethylene glycol (Methods) in 10 mM TEA at pH 7. 
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Figure 2 | Validation of measurement of hydrophobic interaction by 
addition of methanol. a, Adhesion force histograms for C}gH2,SH mono- 
layers measured at pH7 (red, in 10 mM TEA; black, in 60 vol% methanol). 

b, Influence of methanol added to 10 mM TEA (black squares, pH 7) and of pH 
(red circles, in TEA; red crosses, in 60 vol% methanol) on the mean adhesion 
forces calculated from histograms of C,9H2,;SH monolayers. Adhesion 

force histograms were obtained using an alkyl-terminated AFM tip and 
1,057-4,306 pull-off curves from 3-8 independent samples (Extended Data 
Tables 1 and 2). Data show mean + s.e.m. Lines are drawn to guide the eye. 


Data Fig. 1b) was not affected by addition of 60 vol% methanol, indica- 
ting that it has no hydrophobic component. pH and salt concentration 
changes both altered the attractive interaction strength (Extended Data 
Figs 4 and 5), indicating that adhesion is affected by a surface concen- 
tration ofammonium cations that increases with decreasing pH and also 
by the negative surface charge on the AFM tip (Methods). In contrast, 
force measurements with the AmC, ,H»»SH-C,)H2,SH mixed mono- 
layers revealed evidence of hydrophobic interactions: the pull-off force 
measured between the AFM tip and the mixed monolayer increased from 
2.9 + 0.1to4.1 + 0.2 nNas the pH of the aqueous TEA buffer decreased 
from 10.5 to 7 (Fig. 3a, b), and stayed at 2.1 + 0.4 nN independently of 
solution pH in the presence of 60 vol% methanol (Fig. 3a, b). Because 
the decrease in adhesion force on addition of methanol identifies a hy- 
drophobic contribution, these data reveal that protonation of the amine 
groups with decreasing pH doubles the strength of the hydrophobic 
interaction (from 0.80 + 0.2 to 1.9 + 0.2 nN) (Fig. 3c). 

We next replaced Am by guanidinium (Gdm) in the monolayers 
(Fig. 1a), and found that addition of 60 vol% methanol to aqueous 
TEA revealed no measurable hydrophobic interaction between pure 
GdmC,,H,.SH monolayers and the AFM tip for pH values between 
10.5 and 7 (Fig. 3d). This parallels the behaviour of the AmC, ,H,,SH 
monolayers (Extended Data Fig. 1b; see also Extended Data Fig. 1c and 
Methods for a discussion of the orientation-dependent interactions of 
Gdm). But in contrast to results with mixed AmC, ;H,,SH-C, 9H,;SH 
monolayers, the addition of methanol to TEA buffer (from pH 10.5 to 7) 
also had no effect on adhesive interactions between the AFM tip and 
the mixed GdmC, ,H22SH-C9H2SH surfaces (Fig. 3d, e). Proximally 
immobilized Gdm ions thus seem to eliminate measurable hydrophobic 
interactions between the non-polar domains of the mixed monolayer 
and the alkyl-terminated AFM tip (Fig. 3f; see Methods for additional 
discussion of effects of pH). 

If immobilized cationic groups can substantially alter hydrophobic 
interactions of neighbouring non-polar domains in an ion-specific man- 
ner, and given the prevalence of lysine and arginine residues in proteins 
(Am and, respectively, Gdm group in side chains), we proposed that 
modulation of local hydrophobic interactions by neighbouring charged 
groups may be important in peptidic systems. We therefore performed 
force measurements between an alkyl-terminated AFM tip and single 
B-peptide molecules presented from surfaces’ (Fig. 1d, e). We used B- 
peptides known to adopt a helical conformation that is globally amphi- 
philic, with non-polar side chains of trans-2-aminocyclohexanecarboxylic 
acid (ACHC) residues and cationic side chains of either B°*-homolysine 
(B°-hLys) or B’-homoarginine (f°-hArg) segregated onto opposite faces 
of the helix (Fig. 1c). This functional group segregation leads to a well- 
defined, ACHC-rich non-polar domain ~1 nm” in size and a patch of 
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Figure 3 | Influence of immobilized Am or Gdm on hydrophobic 
interactions at monolayer surfaces. a, pH dependence of mean adhesion 
forces measured using AmC,,H>,SH in either 10 mM TEA (red circles) or 
60 vol% methanol (blue crosses), and 40% AmC,,H22SH-60% Cj 9H2;SH in 
either 10 mM TEA (red triangles) or 60 vol% methanol (blue triangles). 
b, Histograms of adhesion forces measured using 40% AmC,;H22SH-60% 
Cj, H2;SH as a function of pH (red, in TEA; blue, in 60 vol% methanol). 
c, Hydrophobic contribution to the mean adhesion forces measured using 
AmC),H3.SH-C;9H2SH (blue crosses), AmC,,H22SH (blue triangles) or 
C,oH2SH (black circles) monolayers. d, Mean adhesion force measured using 
GdmC)H22SH in TEA (red circles) or 60 vol% methanol (green crosses), 
and 40% GdmC, ;H22SH-60% Cj, 9H2,SH in TEA (red crosses) or 60 vol% 
methanol (green circles). e, Histograms of adhesion forces measured using 40% 
GdmC,,H2,H-60% C)9H2;SH as a function of pH (red, in TEA; green, in 
60 vol% methanol). f, Hydrophobic contribution to the mean adhesion forces 
measured using GdmC,,H».SH-C,9H2,SH (green crosses), GdmC,;H».SH 
(green triangles) or Cj9H2SH (black circles). Adhesion force histograms were 
obtained using an alkyl-terminated AFM tip and 922-3,218 pull-off force 
curves from 3-6 independent samples (Extended Data Table 2). Data show 
mean + s.e.m. Lines are drawn to guide the eye. 


three cationic B*-residue side chains immobilized ~1 nm from the non- 
polar domain. Short B-peptides containing =50% ACHC exhibit very 
stable 14-helical conformations in aqueous solution as well as in aque- 
ous methanol'””®, thus allowing identification of hydrophobic interac- 
tions by the addition of methanol. Conventional peptides, comprised 
of «-amino acid residues, lack sufficient «-helix stability to support this 
experimental design”’. 

We immobilized B-peptides onto monolayers terminated in tetra- 
ethylene glycol at a low surface density to allow measurement of the 
interaction of single B-peptide molecules with an alkyl-terminated AFM 
tip’ (Methods and Extended Data Fig. 6). Adhesive force distributions 
obtained for the lysine-containing B-peptide (GA-Lys) in aqueous buf- 
fer were broad and changed with pH (Fig. 4a—d), but narrowed in the 
presence of 60 vol% methanol. The latter could be fitted to a single Gauss- 
ian, whereas two Gaussian functions—one identical to the Gaussian used 
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to fit the 60 vol% methanol data—were needed to provide a good fit to the 
data measured in aqueous buffer. We attribute the two sets of interactions 
undergone by GA-Lys in aqueous buffer to two modes of interaction 
between a B-peptide and the hydrophobic tip (Fig. 4a, inset cartoon), 
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Figure 4 | Influence of lysine and arginine side chains on hydrophobic 
interactions involving oligopeptides. a—h, Histograms of adhesion forces 
measured between alkyl-terminated AFM tips and immobilized B-peptides. 
The surfaces presented either GA-Lys (a-d) or GA-Arg (e-h), and the forces 
are reported as functions of pH in either 10 mM TEA (red) or 60 vol% methanol 
(blue, GA-Lys; green, GA-Arg). The inset in a shows schematic illustrations 
of the interaction between the AFM tip and GA-Lys. i, Dependence on pH 
of the hydrophobic (red squares, GA-Lys; red triangles, GA-Arg) and non- 
hydrophobic (blue circles, GA-Lys; green triangles, GA-Arg) components of 
the interaction of §-peptides with the AFM tip. Adhesion force histograms were 
obtained using 1,542-5,004 pull-off force curves from 4-7 independent 
samples (Extended Data Table 2). Data show mean = s.e.m. Lines are drawn to 
guide the eye. 
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with the deconvolution and force assignment supported by our past 
study’* and by control measurements showing that 60 vol% methanol 
does not change the direct interactions of Am groups with the AFM tip 
(Fig. 2a and Extended Data Fig. 1b). Because only the weaker adhesion 
mode persists in the presence of 60 vol% methanol, we attribute the 
stronger set of adhesive events in aqueous buffer to a hydrophobic in- 
teraction arising when the B-peptide is oriented such that it presents 
a non-polar surface (ACHC residues) to the tip. The weaker adhesion 
events are attributed to a B-peptide orientation that presents largely Am 
surface elements and leads to interactions that are not hydrophobic in 
nature. This interpretation predicts that charging of the lysine side-chain 
amino groups should strengthen hydrophobic interactions between the 
non-polar domain of GA-Lys and the AFM tip, as seen in our data 
(Fig. 4i). For a single GA-Lys molecule, the hydrophobic component of 
the measured adhesion force ranges from 0.18 + 0.02 to 0.29 + 0.02 nN 
(Fig. 41) and arises from a non-polar domain area of ~1 nm”, consist- 
ent with previous single-molecule studies'*”” and our monolayer mea- 
surements (Fig. 2) of adhesion forces in the range 0.80 + 0.2 to 1.9 + 
0.2 nN for contact areas of 10.7 nm* (Methods). 

To further explore the role of the cationic residues in modulating hy- 
drophobic adhesion involving the non-polar domains of the B-peptide 
helix, we replaced the B°-hLys residues with B’-hArg residues to gen- 
erate GA-Arg (Fig. 1c). In striking contrast to the behaviour of GA-Lys 
(Fig. 4a-d), adhesive interactions measured between GA-Arg and the 
AFM tip were largely unaffected by the addition of methanol (Fig. 4e-h). 
Although we do not interpret these measurements in the framework of 
multiple interaction modes between the B-peptide and the AFM tip” 
(Methods and Extended Data Fig. 7), they indicate that the adhesive 
interactions measured with GA-Arg in aqueous buffer are not of hy- 
drophobic origin. 

Our results, which are obtained using two independent systems in 
which specific cationic groups can be spatially juxtaposed with non-polar 
domains, the mixed monolayers and designed B-peptides, lead to a con- 
sistent set of conclusions regarding the effects of proximally immobi- 
lized charge on hydrophobic interactions. Specifically, both studies 
demonstrate that charging of immobilized Am groups strengthens hy- 
drophobic interactions involving proximal non-polar surfaces, whereas 
immobilized Gdm groups diminish hydrophobic interactions to the ex- 
tent that they are not measurable in our experiments. We note, however, 
that the spatial presentation of the non-polar and cationic groups to the 
AFM tip differs between the two systems, as is evident in the widths of 
the force distributions obtained using Am surfaces and GA-Lys (Me- 
thods and Extended Data Fig. 8). 

Past studies of the effects of soluble salts on solvation of molecular 
surfaces have been rationalized in terms of the surface accumulation or 
depletion of ions”, but the effects we have documented here differ: we 
have evaluated the impact of cationic groups covalently immobilized 
very close to non-polar moieties, a situation that often occurs in pro- 
teins and other complex biomolecules yet has not previously been ana- 
lysed systematically. We note that the distance between the cations and 
the non-polar domain in the globally amphiphilic B-peptides used in 
our study is fixed at ~1 nm, and that the impact of cationic groups on 
hydrophobic interactions thus extends over at least this distance. This 
conclusion is generally consistent with atomistic simulations and the- 
ories predicting that non-polar surfaces can influence the structure and 
compressibility of water over distances of ~1 nm (refs 8, 9), and with 
predictions that solvation shells are susceptible to the influence of neigh- 
bouring atoms, especially strongly charged ones”. 

Online Content Methods, along with any additional Extended Data display items 


and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Materials. Tetraethylene glycol thiols terminated in hydroxyl (EG4) or amine 
groups (EG4N) along with 11-guanidinoundecanethiol (GdmC, ,SH) were purchased 
from Prochimia. 1-Decanethiol (C;9SH, 96%), 1-dodecanethiol (C,2SH, 98%), 
1-hexadecanethiol (C,¢SH, 95%), triethanolamine HCl (TEA, 99%) and methanol 
(anhydrous, 99.8%) were purchased from Sigma-Aldrich. Sulphosuccinimidyl-4- 
(N-maleimidomethyl)cyclohexane-1-carboxylate (SSMCC) was purchased from 
Pierce Biotechnology. 11-Aminoundecanethiol (AmC,,SH) was purchased from 
Dojindo Molecular Technologies. Ethanol (reagent, anhydrous, denatured) used for 
preparation of thiol solutions and sodium chloride (99.0%) were purchased from 
Sigma-Aldrich. Ethanol (anhydrous, 200 proof) used for rinsing was purchased from 
Decon Labs. De-ionized water used in this study had a resistivity of 18.2 MQ cm. 
All chemicals were used as received and without any further purification. The AFM 
tips used in this study (triangular in shape, nominal spring constant of 0.01 Nm * 
or 0.03 N m_') were purchased from Bruker. Silicon wafers were purchased from 
Silicon Sense. 

Preparation of f-peptide-decorated surfaces. 3-Peptide oligomers were synthe- 
sized by solid-phase methods as described elsewhere”’ and immobilized as detailed 
previously’*. In brief, we immobilized the B-peptides on mixed monolayers termi- 
nated in EG4 and EG4N groups, using a mole fraction of EG4N of 0.001 to achieve a 
low surface density of the B-peptides. This approach enabled us to measure adhesive 
interactions between single B-peptide molecules and the AFM tip; prior work has 
shown that EG4N-EG4 mixed monolayers do not generate measurable adhesive 
forces with hydrophobic AFM tips in aqueous TEA”. 

Preparation of pure or mixed-component monolayer surfaces. Monolayers of 
organothiols were prepared by immersing small pieces of gold-coated silicon wafers 
into ethanolic solutions containing 1 mM of the appropriate thiol components and 
incubating for 18h. On removal from solution, hydrophilic substrates were rinsed 
thoroughly with ethanol and water, dried with nitrogen and stored in TEA buffer 
(10 mM, pH 7) until AFM adhesion measurements were performed. Hydrophobic 
substrates were rinsed thoroughly with ethanol, dried with nitrogen and used imme- 
diately for AFM measurement. The mixed monolayers shown in Fig. 1a were prepared 
by co-adsorption of w-functionalized undecanethiols (AmC,,SH or GdmC,,SH at 
0.4 mM) and decanethiol (Cj)SH at 0.6 mM). 

Preparation of chemically functionalized AFM tips. Triangular-shaped cantilevers 
with nominal spring constants of 0.01 Nm! were used for experiments involving 
B-peptides, whereas cantilevers with nominal spring constants of 0.03 Nm’ were 
used for experiments involving monolayers unless otherwise specified. The spring 
constants of the cantilevers were calibrated using Sader’s method on a PCM-90 
Spring Constant Calibration Module (Novascan Technologies) and determined to 
be respectively ~0.01 N m7 | and ~0.03 Nm |. AFM tips were coated with a2 nm 
layer of titanium anda 20 nm layer of gold by physical deposition using an electron 
beam evaporator. Following the coating by gold, the tips were immersed in a 1 mM 
ethanolic solution of C;,SH and incubated overnight. On removal from solution, 
the tips were rinsed with ethanol, dried with a gentle stream of nitrogen and im- 
mediately transferred to the AFM fluid cell. 

Characterization of mixed monolayers. Ellipsometry was used to confirm forma- 
tion of monolayers of organothiols on gold films (Extended Data Fig. 3a), and X-ray 
photoelectron spectroscopy was used to determine the compositions of the mixed 
monolayers shown in Extended Data Fig. 3b. The results shown in Extended Data 
Fig. 3 reveal that the compositions of the mixed monolayers were similar to the 
compositions of the thiol solutions from which the mixed monolayers were formed 
(CyoSH mole fraction of 0.6). 

AFM force measurements. Adhesion force measurements were performed using a 
Nanoscope IIIa Multimode AFM equipped with a fluid cell (Veeco Metrology Group). 
Triangular-shaped silicon nitride cantilevers were used and functionalized as described 
above. Force measurements were performed at room temperature. Force curves were 
recorded using a constant contact time of 500 ms and retraction and approach speeds 
of 1,000 nm s_'. All measurements were performed in 10 mM aqueous TEA solutions 
unless otherwise specified. 

Interpretation of adhesion measured between alkyl-terminated surfaces. We mea- 
sured advancing contact angles of water (0,4) on monolayers formed from CgH2;SH 
and C2H>3SH to be 108 + 3° and 107 + 2°, respectively, consistent with formation of 
hydrophobic surfaces. We also confirmed that the addition of TEA to water does not 
measurably influence the water-mediated adhesion between the two hydrophobic 
surfaces (Extended Data Fig. 2). We used the adhesion measurements shown in 
Extended Data Fig. 2 in combination with knowledge of the surface tension of water 
(Vtiquid—vapour = 72.4 mJ m * (ref. 18)) to estimate (i) the work of adhesion (W4q) 
between the alkyl-terminated AFM tip and an alkyl-terminated monolayer in water, 
and (ii) the apparent radius of curvature of the AFM tip in these measurements. Wag 
in water is related to the interfacial free energies (y) as*° 
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Wad = Ysample—water + tip —water — Ysample—tip 


For identical surface functional groups (alkyl-terminated surfaces) on both the sample 
and the AFM tip, ample-water ~ Vtip-water 2Nd Ysample-tip ~~ 0, and W,q thus becomes the 
work of cohesion”® (W,,): 


Waa Woo 2) 27 tip — water (1) 


) = 
sample — water 


The interfacial energies between the surface and water in equation (1) can be obtained 
from contact angle measurements and use of Young’s equation’*: 


Y surface — water = Ysurface — vapour —~ Ywater—vapourCOS( Daa) (2) 


By combining equations (1) and (2), and using /surface-vapour ~ 20 mJ m ” (ref. 27), 
we estimate W,q between the alkyl-terminated surface and AFM tip to be 85 mJ m 
This value is consistent with previously reported values of W,q for hydrophobic 
surfaces, which range from 85 to 100 mJ m ” (ref. 28). From knowledge of Wag and 
the adhesion force (F,q), the Johnson—Kendall—Roberts model can be used to deter- 


mine the effective AFM tip radius!* (Ryxr): 
3 
Fug = 3 Tie Waa (3) 


From equation (3) (with F,g = 4.0 nN and W,q = 85 mJ m 7; see above), we estimate 
Ryxg to be 10.7 nm, an effective radius similar to that inferred from past measure- 
ments of F,q using gold-coated AFM tips covered with monolayers of alkanethiols””?. 
We note that this effective radius is smaller than the AFM tip radius determined with 
scanning electron microscopy (Rsem = 53 + 5 nm). As discussed in detail elsewhere, 
this difference arises from the presence of gold asperities, resulting in adhesive inter- 
actions between functionalized gold domains that are smaller than Rspm (refs 30, 31). 
In the main text, we report that the addition of 60 vol% methanol to the aqueous 
solution results in an adhesive force between alkyl-terminated surfaces of 2.1 + 0.2 nN 
(Fig. 2b). In solutions containing 60 vol% methanol, we measured the AFM tip to jump 
into contact with the alkyl-terminated surface at a distance (Dj) of 17 + 2nm. We 
estimate the Hamaker constant (A) corresponding to the jump-to-contact dis- 

tance as'® 
kD 


jtc 
= 4 
3RsEm (4) 


where k = 0.01 Nm is the AFM cantilever spring constant. From equation (4), we 
calculate A to be 30 X 10° *°J, a value consistent with past reports of Hamaker con- 
stants for van der Waals interactions between gold surfaces (10 X 10°’ to 40 X 10° 7°J; 
ref. 32). Past studies have also reported that interactions involving self-assembled 
monolayers formed on gold films are influenced by van der Waals forces from the 
underlying gold films'*****. Overall, our measurements of the interactions between 
methyl-terminated surfaces in 60 vol% methanol are consistent with a dominating 
influence of van der Waals forces. We note that our experimental measurements in 
60 vol% methanol suggest that the adhesive force of 2.1 + 0.2 nN comprises principally 
van der Waals force (1.8 + 0.2 nN) with a small (0.3 + 0.2 nN) residual hydrophobic 
contribution (Fig. 2b). 

Adhesive interactions involving ammonium-terminated surfaces (AmC,,H>2SH). 
Figure 3a and Extended Data Fig. 1b show that adhesive interactions between 
AmC,,H>.SH monolayers and alkyl-terminated AFM tips increase in strength with 
decreasing pH. Here we discuss the origins of this pH dependence of the adhesive 
interaction. In this context, we note that past experimental studies have documen- 
ted that hydrophobic surfaces acquire an excess of negative surface charge density 
in aqueous solution**~*’. Consistent with these past studies, our electrophoretic 
mobility measurements of C;2H23SH-functionalized gold nanoparticles (diameter 
of 150 nm) confirmed the presence of negative C-potentials in aqueous TEA (—26 + 
9mV at pH7 and —38 + 9mV at pH9). Our adhesion force measurements are 
consistent with the excess negative surface charge of the hydrophobic AFM tip me- 
diating a screened electrostatic attraction to the positively charged AmC,,H».SH 
monolayers’’”*. Specifically, we measured the presence of a long-range attraction 
between the approaching AFM tip and the AmC,,SH surface that caused the AFM 
tip to jump into contact with the AmC,,SH surface. The distance at which the tip 
jumped depended on the pH and ionic strength (Extended Data Fig. 4). An increase 
in ionic strength was observed to reduce the jump-to-contact distance (from 7 nm 
in TEA at pH7 to 3nm in TEA at pH7 with 100 mM NaCl), consistent with an 
increase in the screening of the interaction with increasing ionic strength. We eval- 
uated the Debye length (Ap) as”® 

(5) 


Ap= 


where ¢, is the dielectric constant, 9 is the permittivity of free space, kg is the Boltz- 
mann constant, T is the temperature, ¢ is the electron charge, z; is the charge of the ith 
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ion and M, is the molar concentration of the ith ion. From equation (5), we estimate Ap 
to be 4.31 nm in TEA and 0.94 nm in TEA with 100 mM NaCl. These estimates of 7p 
are generally consistent with the distances at which the jump to contact was measured, 
thus providing support for our conclusion that the long-range attraction is a con- 
sequence of the interaction of two electrical double layers. Additionally, in the absence 
of added salt, we observed the magnitude of the long-range attractive force to decrease 
with increasing pH, consistent with the effects of a decrease in the density of charge on 
the amine-terminated surface (Extended Data Fig. 4). 

In Fig. 3c, we show that the mean hydrophobic force measured between the AFM 
tip and the 40% AmC,,H2,SH-60% C,9H3,;SH monolayer at pH 7 is similar to the 
mean hydrophobic force measured between the AFM tip and the monolayer of 
CioH2:SH. For other mixed-monolayer compositions, however, we found the adhe- 
sion force generated by the mixed monolayers at pH 7 to differ from the adhesion 
forces measured using C;9H2;SH. For example, as shown in Extended Data Fig. 5, a 
monolayer of 90% AmC,,H>.SH-10% C,9H2,SH at pH7 does not generate the 
same magnitude of hydrophobic adhesion force as the CjpH2:SH monolayer. 
Adhesive interactions involving Gdm-terminated surfaces. In Fig. 3d and Ex- 
tended Data Fig. 1c, we report the pH dependence of the adhesive interaction between 
GdmC,,H2,SH surfaces and the hydrophobic AFM tip. Whereas the strength of the 
adhesive interaction did not significantly change between pH 10.5 and 8, on decreas- 
ing the pH from 8 to 7 a measurable increase in adhesive strength was recorded. We 
note that the interaction of the GdmC, HSH surface with the hydrophobic AFM 
tip is more complex than the interaction of the AmC,,H,,SH surface with the tip 
because, in addition to interacting with electrical double layers, Gdm binds to non- 
polar domains through orientation-dependent (see below) van der Waals interac- 
tions”’. The pH-dependent trends seen in Fig. 3d are also evident in the measurements 
shown in Fig. 4i using GA-Arg. 

Inspection of Fig. 3d, f shows that GdmC,,H22SH-C}9H2:SH monolayers interact 

with the hydrophobic AFM tip more strongly than do GdmC,,H2.SH monolayers. 
Past studies have concluded that Gdm preferentially interacts with non-polar domains 
ina planar orientation”. We speculate that the difference in the strength of the adhesion 
between alkyl surfaces and either GdmC,,;H2.SH or GdmC,;H22.SH-C;9H2;SH sur- 
faces stems from differences in the orientation of the Gdm groups presented by the two 
monolayers. For example, it is possible that within the GdmC, ,H22SH monolayers, the 
Gdm functional groups are densely packed such that they can interact with the hydro- 
phobic surface only in a side-on orientation. In contrast, within the GdmC,;H2.SH- 
CioH2,SH surfaces, the Gdm functional groups protrude above the surface, resulting in 
a greater number of orientational degrees of freedom, which would allow face-on 
interactions that would generate larger adhesion forces. 
Interactions of single B-peptides with the AFM tip. The density of surface-immobilized 
B-peptide molecules used in our study was selected by performing screening mea- 
surements at different immobilization densities. The density of surface-immobilized 
B-peptide molecules was controlled by using mixed monolayers terminated in EG4 
and EGAN groups (the mole fraction of EG4N in the mixed monolayer was varied to 
change the immobilization density of the B-peptides). For ease of interpretation of 
our measurements, guided by our past results'*, isoGA-Lys (Extended Data Fig. 6) 
was used in these screening measurements. We measured the mean adhesion force 
to increase with mole fraction of EG4N in the mixed monolayer used to immobilize the 
B-peptides when the mole fraction of EG4N was greater than 0.002 (0.14 + 0.015 nN 
at a mole fraction of 0.002, 0.7 + 0.05 nN at 0.01 and 1.1 + 0.08 nN at 0.1), indicating 
that at immobilization densities above 0.002 the adhesive force reflects the simultan- 
eous interaction of multiple B-peptides with the AFM tip. However, when the mole 
fraction of EG4N in the mixed monolayer used to immobilize the B-peptides was 
below 0.002, the adhesion force ceased to change with decreasing immobilization 
density (0.14 + 0.015 nN at 0.002, 0.13 + 0.013 nN at 0.001), consistent with an adhes- 
ive pull-off force that reflects the interaction of a single B-peptide with the AFM tip. 
In the measurements reported in this paper, we used an immobilization density of 
B-peptides of 0.001 to obtain measurements of the interactions of single B-peptide 
molecules with the AFM tip. 

To enable a comparison of the magnitude of the hydrophobic adhesion forces 
measured using single B-peptides with the magnitude of the hydrophobic adhesion 
forces measured using the alkyl-terminated monolayers, we used Johnson-Kendall- 
Roberts theory to estimate the contact radius (a) at pull-off for the measurements 
using the monolayers’*”*: 


30Ricp Wad 
= 5K (6) 


where K is the elastic modulus of the contacting surfaces (K ~ 6.4 GPa; refs 39, 40). 
By using equation (6) and the values of Rjxp and Wag reported above, we estimate the 
contact area at pull-off to be 10.7 nm7. In contrast, the size of the non-polar domain 
of the B-peptide is ~1 nm? (refs 41, 42), which is approximately ten times smaller 
than the contact area involved in the monolayer adhesion measurements. Consistent 
with this conclusion, we measured the magnitudes of the hydrophobic forces in the 


a 


monolayer experiments to lie between 0.80 and 1.9nN (Fig. 3c), whereas in the 
B-peptide experiments they ranged from 0.16 to 0.28 nN (Fig. 4i). 

Comparing the widths of the adhesion force histograms. As noted in the main text, 
differences in the spatial presentation of the non-polar and cationic groups between 
the B-peptide and the monolayer systems are evident in the force histograms obtained 
using GA-Lys or mixed monolayers of AMC; ;H22SH-C)oH2)SH. Specifically, immo- 
bilized GA-Lys molecules possess rotational degrees of freedom that permit the non- 
polar AFM tip to interact with two distinct helical faces, that is, surfaces consisting 
entirely of non-polar ACHC residues or containing largely cationic residues (Fig. 4a, 
cartoon). This leads to two modes of interaction when measurements are performed 
in aqueous buffer (as described in the main text). However, when the measurements 
are performed in the presence of methanol, there is only one dominant mode of 
interaction with the AFM tip, which is reflected in a narrowing of the force histograms 
(from a coefficient of variation of 0.32 in aqueous buffer to 0.22 in the presence of 
methanol; see Extended Data Fig. 8). In contrast, the contact area between the AFM 
tip and a mixed monolayer of AMC, ,;Hj»SH-C,9H2,SH contains a statistical mixture 
of approximately 50 charged and non-polar groups in total, and there is thus little 
change in the coefficient of variation of the force histogram on addition of methanol 
(see Extended Data Fig. 8 and below). In contrast to observations with GA-Lys, addi- 
tion of methanol during measurements involving GA-Arg did not lead to a change in 
the apparent number of modes of interaction of GA-Arg with the AFM tip, and the 
coefficients of variation of the histograms obtained with GA-Arg are thus similar with 
and without methanol (Extended Data Fig. 8). 

To provide insight into the above-described coefficients of variation obtained from 
our experiments, we modelled the statistical variation of the composition of the mixed 
monolayers within the area of contact made with the AFM tip by using a binomial 
distribution. The mean (j1) and variance (a7) of the number of either cationic or alkyl 
groups within the contact area with the AFM tip can be calculated from the total 
number of functional groups (1) and the probability (p) of finding either a cationic or 
alkyl functional group as 


w=np 


Within the contact area between the AFM tip and a monolayer, we estimate the 
total number of groups to be n ~ 50 by assuming a thiol packing density of 0.214 nm? 
per thiol*? and a contact area at pull-off of 10.7 nm? (see above). We calculate the 
coefficient of variation for the methyl group within the mixed monolayer to be 0.12 
(for n = 50, P = 0.6) and the coefficient of variation for the cationic group within the 
mixed monolayer to be 0.17 (for n = 50, P = 0.4). The coefficients of variation mea- 
sured for the mixed monolayers (0.08 to 0.11) are comparable to the coefficients 
of variation calculated for a binomial distribution of cationic and non-polar units, 
consistent with a molecularly mixed monolayer of cationic and alkyl groups. 
Influence of the buffer anion on hydrophobic adhesion. The measurements reported 
in Figs 1-4 were obtained using 10 mM TEA HClas the buffer. We confirmed that a 
change in buffer did not influence the measurements of hydrophobic forces reported 
in this paper. Specifically, as detailed in Extended Data Fig. 7, we determined that 
measurements performed using 10 mM MOPS as the buffer were indistinguishable 
from measurements performed using TEA HCl. 
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Extended Data Figure 1 | Influence of pH and addition of methanol reported as a function of pH (red, in TEA; blue, in 60 vol% methanol). In TEA: 
(60 vol%) on adhesive interactions between self-assembled monolayers and =n = 1,309, N= 5 at pH 7; n = 1,797, N= 4 at pH8, n = 1,605, N= 4 at pH9; 
alkyl-terminated AFM tips. a, Adhesion force histograms for CjgH2,SH n= 1,009, N = 4 at pH 10.5. In 60 vol% methanol: n = 1,772, N = 3 at pH7; 
monolayers interacting with an alkyl-terminated AFM tip, measured as a n= 1,614, N = 5at pH 8;n = 1,603, N= 5 atpH9;n = 1,151,N=4atpH10.5. 
function of pH (red, in TEA; black, in 60 vol% methanol). n = 3,002 (number of —_c, Histograms of adhesion forces measured between an alkyl-terminated AFM 
test events), N = 3 (number of independent samples) (TEA pH 7); n = 3,084, tip and monolayers of GdmC,,H).SH, measured as a function of pH (red, in 
N=4 (TEA pH8); n = 1,076, N= 3 (TEA pH9); n = 1,288, N= 6 (TEA ; i 
pH 10.5); 2 = 3,812, N = 4 (60-vol% MeOH pH 7); 


TEA; green, in 60 vol% methanol). In TEA: n = 1,693, N = 4 at pH 7;n = 1,164 
n = 4,306, N = 8 (60 vol% N= 4 at pH8; n = 2,002, N= 4 at pH9; n = 1,249, N= 3 at pH 10.5. In 
MeOH pH 8); n = 1,057, N = 4 (60 vol% MeOH pH 9); n = 1,093, N= 6 


3093, N= 60 vol% methanol: n = 1,907, N= 4 at pH 7; n = 1,211, N= 4 at pH8; 
(60 vol% MeOH pH 10.5). b, Histograms of adhesion forces measured between 1 = 2,618, N= 5 at pH9; n = 1,178, N= 3 at pH 10.5. The histograms show 
an alkyl-terminated AFM tip and monolayers formed from AmC,,;H>2SH data obtained from all pull-off force curves from all samples 
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Extended Data Figure 2 | Comparison of adhesive interactions measured alkyl-terminated AFM tip under different solution conditions (red, in TEA at 
between hydrophobic surfaces in pure water and in aqueous TEA. pH7, n = 3,002, N = 3; blue, in water, n = 4,770, N = 6). The histograms show 
Histograms of adhesion forces for CjgH2,SH monolayers interacting with an _—_ data obtained from all pull-off force curves from all samples. 
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Extended Data Figure 3 | Characterization of the composition of mixed 
monolayers. a, Ellipsometric thicknesses of monolayers used in this study 

(n = 3, N= 3). b, Ratio of nitrogen to sulphur signal, obtained by X-ray 
photoelectron spectroscopy (n = 3, N = 3), for mixed monolayers, plotted as a 
function of the mole fraction of the Am- or Gdm-terminated alkanethiol in the 
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the error bars show the s.d. of three independent samples. c-h, Nitrogen (blue) 
and sulphur (red) signals obtained by X-ray photoelectron spectroscopy for 
mixed monolayers formed on the surfaces of gold films: GdmC,,H22SH- 
CoH2:SH (c), AmC,,H22SH-C9H2,;SH (d), bare gold (e), GdmC,,H22SH 
(f), AMC, HSH (g) and CjoH2:SH (h). 
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Extended Data Figure 4 | Influence of pH and ionic strength on thedistance AFM tips interacting with AmC,,H».SH monolayers, as measured using the 
dependence of the interaction of a hydrophobic AFM tip and an Am- indicated aqueous solution conditions. 
terminated monolayer (on approach). Approach curves for alkyl-terminated 
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Extended Data Figure 5 | Influence of pH and addition of methanol n= 1,480, N= 4 at pH9; n = 1,730, N= 4 at pH 10.5) or 60 vol% methanol 
(60 vol%) on adhesive interaction between an alkyl-terminated AFMtip and (blue triangles: n = 972, N= 4 at pH7; n = 1,548, N= 4 at p 8; n = 1,294, 
monolayers containing 90% AmC,,;H2,SH-10% C,9H2;SH. a, pH N= 4 at pH9; n = 1,176, N= 4 at pH 10.5). b, Hydrophobic contribution to 
dependence of mean adhesion force measured between an alkyl-terminated the mean adhesion forces measured using 90% AmC,,H2.SH-10% C,9H2;SH 


AFM tip and Am-containing monolayers: 90% AmC,,H».SH-10% C;9H2\SH (red triangles), AmC,,;H2.SH (blue triangles) or C,9H2,SH (black circles) 
in either TEA (red triangles: n = 1,344, N = 4at pH7;n = 1,326, N=5atpH8; monolayers. Data show mean + s.e.m. 
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Extended Data Figure 6 | Non-globally amphiphilic B-peptide. Linear and helical representations of the non-globally amphiphilic B-peptide isoGA-Lys. 
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Extended Data Figure 7 | Influence of dissolved anions (MOPS versus CI") 
on hydrophobic interaction. a, The chemical structure of MOPS. 
b, Hydrophobic contribution to the mean adhesion force measured using 
monolayers of AmC;;H22SH-Cj9H2:SH (red crosses, using MOPS-60 vol% 
methanol; blue crosses, using TEA-60 vol% methanol), AmC,,;H>.SH (red 
triangles, using MOPS-60 vol% methanol; blue triangles, using TEA—60 vol% 
methanol) or C;9H2)SH (red circles, using MOPS-60 vol% methanol; black 
circles, using TEA-60 vol% methanol). c, Hydrophobic contribution to the 
mean adhesion force measured using monolayers of GdmC,,H2.SH- 
C,9H2)SH (red crosses, using MOPS-60 vol% methanol; green crosses, using 
TEA-60 vol% methanol), GdmC,,H>,SH (red triangles, using MOPS-60 vol% 
methanol; green triangles, using TEA-60 vol% methanol) or C;gH2)SH (red 
circles, using MOPS-60 vol% methanol; black circles, using TEA-60 vol% 
methanol). On Cj gH2;SH surface: n = 1,702, N= 4 (MOPS pH 7); n= 1,014, 
N=4 (MOPS pH 8);n = 1,006, N = 3 (MOPS pH 9); n = 1,008, N = 4 (MOPS 
pH 10.5); n = 1,002, N = 4 (MOPS-60 vol% MeOH pH 7); n = 1,000, N= 3 
(MOPS-60 vol% MeOH pH 8); n = 1,009, N = 3 (MOPS-60 vol% MeOH 
pH 9); n= 1,001, N= 4 (MOPS-60 vol% MeOH pH 10.5). On AmC, ,H>,SH- 
C,oH2)SH surface: n = 1,100, N= 3 (MOPS pH 7); n = 1,201, N= 4 (MOPS 
pH 8); n = 989, N= 4 (MOPS pH 9); n = 998, N= 4 (MOPS pH 10.5); 


n= 1,122, N= 4 (MOPS-60 vol% MeOH pH 7); n = 997, N = 3 (MOPS- 

60 vol% MeOH pH 8); n = 1,126, N= 4 (MOPS-60 vol% MeOH pH 9); 

n= 1,328, N= 3 (MOPS-60vol% MeOH pH 10.5). On GdmC, HSH 
surface: n = 1,000, N= 3 (MOPS pH 7); n = 1,001, N= 3 (MOPS pH 8); 

n = 1,002, N= 3 (MOPS pH 9); n = 1,001, N = 3 (MOPS pH 10.5); n = 1,003, 
N= 3 (MOPS-60 vol% MeOH pH 7); n = 1,000, N = 3 (MOPS-60 vol% 
MeOH pH 8); n= 1,001, N= 3 (MOPS-60 vol% MeOH pH 9); n = 1,005, 
N= 3 (MOPS-60 vol% MeOH pH 10.5). On GdmC,;H22SH-C)9H2:SH 
surface: n = 999, N= 3 (MOPS pH 7); n = 1,001, N= 3 (MOPS pH 8); 

n= 1,000, N= 3 (MOPS pH 9); n = 999, N= 3 (MOPS pH 10.5); n = 1,002, 
N= 3 (MOPS-60 vol% MeOH pH 7); n = 999, N = 3 (MOPS-60 vol% MeOH 
pH8); n= 1,001, N= 3 (MOPS-60 vol% MeOH pH 9); n = 1,002, N= 3 
(MOPS-60 vol% MeOH pH 10.5). On AmC,H22SH surface: n = 1,004, N = 3 
(MOPS pH 7); n = 1,002, N = 3 (MOPS pH 8); n = 1,002, N = 3 (MOPS pH 9); 
n= 1,002, N= 3 (MOPS pH 10.5); n = 1,001, N= 3 (MOPS 60 vol% MeOH 
pH7); n= 1,001, N = 3 (MOPS-60 vol% MeOH pH 8); n = 1,001, N= 3 
(MOPS-60 vol% MeOH pH 9); n = 1,001, N = 4 (MOPS-60 vol% MeOH 

pH 10.5). Measurements were conducted as described in Methods. Data show 
mean + s.e.m. Lines are drawn to guide the eye. 
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Extended Data Figure 8 | Characterization of the widths of adhesion force pH 9; blue, in 60 vol% methanol). Measurements were conducted as detailed 
histograms. The coefficient of variation was calculated from histograms of in Methods. 
the adhesion forces measured using the indicated surfaces (red, in TEA at 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


Extended Data Table 1 | Statistical information for data shown in Fig. 2b related to the influence of methanol 


vor Methanol TM Nomis /n independent sarples 
0% 3,002 3 
20% 2,128 4 
40% 1,034 3 
60% 3,810 4 
80% 1,053 3 
100% 1,004 3 
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Extended Data Table 2 | Statistical information for data shown in Fig. 2b (pH-dependent data) and Figs 3 and 4 


Figure 


2b 


3a 


3a 


3d 


3d 


4a-d 


4e-h 


The number of test events / The number of samples = (n/N) 


Monolayer TEA TEA TEA TEA 60% MeOH 60% MeOH 60% MeOH 60% MeOH 
pH7 pH8 pH9 pH10.5 pH7 pH8 pH9 pH10.5 
CigH»,SH 3,002/ 3 3,084/ 4 1,076/3 1,288 /6 3,812/ 4 4,306/ 8 1,057/ 4 1,093/ 6 


AmC,,H»oSH 130975  1,797/4 11,6054 1,00974 11,7773 16145 1,6035 1,151/4 
40% AmGyHeSH/ 4799/4 2 22 1,626/4 1,569/4 2982/7 1,669/4 
Aft a ct 709/ 800/ 3 0085/5  922/3 626/ 569/ 982/ 669/ 

GdmC,,H»,SH 1693/4 11644 20024 11,2493 1907/4 1,211/4 26185  1,178/3 
40% GdMC1;HeeSH/ 34396 2997/5 3.2186 2949/6  3,030/6 3,075 3018/6 2,912/6 
60% C,gH>,SH 


GA-Lys 3,175/ 4 1,957/ 4 1,606/ 4 1,654/ 4 3,666/ 4 3,268/ 4 1,542/ 4 1,867/ 4 


GA-Arg 3,676/ 6 4,221/7 3,573/ 6 2,580/ 5 2,191/6 3,785/ 6 5,004/ 5 2,582/ 5 
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Non-stabilized nucleophiles in Cu-catalysed 
dynamic kinetic asymmetric allylic alkylation 


Hengzhi You!, Emeline Rideau’, Mireia Sidera! & Stephen P. Fletcher! 


The development of new reactions forming asymmetric carbon- 
carbon bonds has enabled chemists to synthesize a broad range of 
important carbon-containing molecules, including pharmaceutical 
agents, fragrances and polymers’. Most strategies to obtain enanti- 
omerically enriched molecules rely on either generating new stereo- 
genic centres from prochiral substrates or resolving racemic mixtures 
of enantiomers. An alternative strategy—dynamic kinetic asymmetric 
transformation—involves the transformation of a racemic starting 
material into a single enantiomer product, with greater than 50 per 
cent maximum yield”’. The use of stabilized nucleophiles (pK, < 25, 
where K, is the acid dissociation constant) in palladium-catalysed 
asymmetric allylic alkylation reactions has proved to be extremely 
versatile in these processes**. Conversely, the use of non-stabilized 
nucleophiles in such reactions is difficult and remains a key challenge*? 

Here we report a copper-catalysed dynamic kinetic asymmetric trans- 
formation using racemic substrates and alkyl nucleophiles. These 
nucleophiles have a pK, of =50, more than 25 orders of magnitude 
more basic than the nucleophiles that are typically used in such trans- 
formations. Organometallic reagents are generated in situ from alkenes 
by hydrometallation and give highly enantioenriched products under 
mild reaction conditions. The method is used to synthesize natural 
products that possess activity against tuberculosis and leprosy, and 
an inhibitor of para-aminobenzoate biosynthesis. Mechanistic studies 
indicate that the reaction proceeds through a rapidly isomerizing 
intermediate. We anticipate that this approach will bea valuable com- 
plement to existing asymmetric catalytic methods. 

Two strategies for generating single enantiomer compounds using 
enantioselective catalysis have become widely used: the first strategy gen- 
erates a chiral product bya selective reaction which introduces asymmetry 
to a prochiral substrate (Fig. 1a). This prochiral approach has proven 
important in the development of catalytic asymmetric reactions, despite 
the rather limited availability of prochiral substrates when compared 
to chiral substrates. The second widely used strategy is to start from a 
racemic mixture of chiral starting materials. Here the catalyst selec- 
tively reacts with one of the two enantiomers allowing differentiation 
(or resolution) of the enantiomers, but the yield is necessarily limited 
to 50%, as the undesired enantiomer remains as starting material. An 
efficient variation of this second strategy is to couple enantiomer differ- 
entiation with interconversion of the enantiomers, commonly called 
dynamic kinetic resolution. This strategy allows yields of more than 50% 
from chiral starting materials** 

Transition-metal-catalysed asymmetric allylic alkylation (AAA) reac- 
tions have proven to be powerful tools for creating new carbon-carbon 
bonds*?""'. Twenty years ago, palladium-catalysed AAA reactions were 
reported in which racemic mixtures were converted to single enantio- 
mer products with high yield and enantioselectivity (Fig. 1b)'’. The use 
of stabilized nucleophiles (pK, < 25) in metal-catalysed AAA reactions 
is now well established in catalysis and synthesis*’. These reactions are 
dynamic kinetic asymmetric transformations (DYKATs) as they form 
one enantiomer of a new product from both enantiomers of a racemic 
starting material, and no ‘resolved’ starting material is recovered*”. Mech- 
anistic studies reveal that Pd-catalysed DYKATs operate through a 


pathway where both enantiomers of the starting material are converted 
to a common pseudo-prochiral intermediate, and a subsequent enan- 
tioselective step creates the new stereogenic centre®. The use of non- 
stabilized nucleophiles in such reactions is a long-standing research goal*° 
and efforts to broaden the scope of DYKAT to non-stabilized carbon 
nucleophiles now allows certain specific ‘less-stabilized’ partners to be 
used in Pd-catalysed asymmetric (for example, Fig. 1c)”!’ and non- 
enantioselective procedures’. 

One difficulty in expanding the scope of DYKATs is that metal- 
catalysed AAAs do not rely on a single mechanistic pathway’. In par- 
ticular, there is a significant mechanistic distinction between stabilized 
and non-stabilized nucleophiles (Fig. 1d)**”"°. In the case of stabilized 
nucleophiles, the key bond-making event occurs outside of the coor- 
dination sphere of the metal, so that the stereochemistry is determined 
when the nucleophile attacks a carbon atom of a m-allyl-Pd intermedi- 

e’*, In contrast, when using non-stabilized nucleophiles bond forma- 
tion probably occurs through reductive elimination of an intermediate 
in which the nucleophile is bound to the metal centre”. 

Catalytic asymmetric reactions with non-stabilized nucleophiles often 
use copper catalysts'*’”. These reactions have proven much more dif- 
ficult to study mechanistically than their Pd-catalysed counterparts, as 
non-stabilized organometallic reagents are typically very reactive, hin- 
dering the isolation of intermediates and characterization of pathways. 
Copper-catalysed allylic alkylations are generally accepted to operate via 
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Figure 1 | Asymmetric allylic alkylation (AAA) procedures. a, Typical 
examples of the prochiral approach to asymmetric catalysis, in which a species 
selectively adds to one face of a prochiral substrate, exemplified (on the left) 
by asymmetric conjugate addition and (on the right) by AAA. Nu, nucleophile; 
R, generic substituent (or generic group); X, leaving group. b, A prototypical 
Pd-catalysed DYKAT, where AAA with a stabilized carbon nucleophile 
transforms a racemic mixture of starting materials into a single highly 
enantioenriched product. c, An example of DKYAT in AAA using heterocyclic 
nucleophiles with a pK, > 25. d, Transition-metal-catalysed allylic substitution 
mechanisms with stabilized and non-stabilized nucleophiles. 
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regiospecific anti-addition mechanisms, where selective rate-determining 
Sn2' oxidative additions give copper(II) intermediates, and fast reduc- 
tive eliminations form the final product'*””. A key distinction between 
Cu- and Pd-AAA reactions is a lack of an efficient G- to m- isomeriza- 
tion’® in the case of copper, so that if reductive elimination is fast, a 
m-allyl-Cu species will not form and reactions involving racemic start- 
ing materials will lead to racemic products. 

Copper-catalysed AAA on prochiral materials, where selective oxid- 
ative addition occurs on one enantiotopic face of the substrate, can be 
readily achieved'*'*"'*, However, the generally accepted Cu-catalysed 
mechanisms would seem to preclude efficient asymmetric reactions with 
chiral racemic substrates, where addition would be controlled by the 
stereogenic centre of the substrate, not the catalyst’. We note that race- 
mization of enantiopure starting materials during Cu-catalysed allylic 
alkylation has been reported, suggesting the possibility of generating 
useful 1-allyl-Cu species and DYKATs”’. Building on this work, promis- 
ing initial results have been obtained with Grignard reagents”, but 
despite extensive examination, and the elucidation of interesting mech- 
anistic pathways, generally useful procedures have not emerged”!”’. 

Based on the observation that alkylzirconium reagents undergo 
highly enantioselective conjugate addition at ambient temperature”**, 
whereas procedures with traditional non-stabilized carbon nucleophiles 
(based on Mg, Al, and Zn species) are normally performed at very low 
temperatures'*’’”, we hypothesized that alkylzirconium reagents may 
allow efficient AAA from racemic substrates. Here we describe AAA of 
racemic cyclic allylchlorides with alkylzirconocenes initiated by copper 
catalysts. Highly enantioenriched products are obtained at room tem- 
perature in convenient procedures, in which all of the reaction com- 
ponents are commercially available. 

We began by exploring the coupling of racemic 3-substituted cyclo- 
hexenes 1 and the alkylzirconium species generated in situ from 4-phenyl- 
1-butene (2 in Table 1) and the Schwartz reagent (Cp2ZrHCl) under a 


Table 1 | Selected optimization experiments 


o-oo 


OP »ZrHCL CHLClys CH,Cl,; 


——~“Soivent 
10 mol% copper 
10 mol% ligand 


é 3a 


Bu 
oO ZA OCH, 
COIS COND OY 
oO ‘Bu P =N 
“Os OO OC” 
cite OO 
Re OE 
Buc NS B c 
OCH, 
Entry Leaving group Ligand Copper source Solvent Temp.  e.e.(%) 
CC) 
1 Br (1a) A (tBuCN)2CusOTf CH2Clo 25 4 
2 OP(O)(OEt)2(1b) A (tBUCN)sCusOTf CH2Cl, 25 10 
3 OoCCF3 (1¢) A (tBuCN)2CusOTf CH2Clo 25 10 
4 Cl (1d) A (tBUCN)2CueOTf ~CH2Clo = 25 35 
5 Cc B (tBuCN)2CueOTf CH2Clo 25 i 
6 Cc Cc (tBuCN)2CusOTf CH2Clo 25 71 
7 C Cc CuCl/AgSbFg  CH2Clo 25 51 
8 Cc Cc CuCl CH2Clo = =25 49 
9 Cc Cc CuBr CH2Clo = =25 73 
10 C Cc Cul CH2Clo =25 89 
11 Cc Cc Cul CHCl3 25 93 
12 Cc Cc Cul CHCl3 0) 95 
13 C Cc Cul CHCl3 60 77 
14 Br (1a) Cc Cul CHCl3 25 77 
Optimization was performed for the AAA of racemic materials bearing different leaving groups and 
4-phenyl-1-butene, which was hydrometallated using the Schwartz reagent. Different copper sources, 
ligands, solvents and temperatures were examined. The enantiomeric excess (e.e.) was determined by 


HPLC. Full conversion was observed in all cases shown. 
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variety of reaction conditions. The use of Pd- and Ir-based catalysts did 
not lead to product 3a, but when Cu-catalysts were used, product 3a 
was obtained with varying degrees of enantioselectivity. The combination 
of 3-chloro-cyclohexene 1d, phosphoramidite C"', and ((BuCN),Cu: OTf 
in dichloromethane provided 3a in quantitative yield with 71% enan- 
tiomeric excess (e.e.; Table 1, entry 6). Variation of the copper counterion 
under these conditions showed that Cul gave the highest e.e. (89% e.e., 
entry 10) and chloroform gave the best results of the solvents exam- 
ined (93% e.e., entry 11). Varying the temperature (entries 12 and 13) 
showed that slightly higher enantioselectivity was obtained by cooling 
the reaction to 0 °C (93% versus 95% e.e.) but we chose to perform the 
rest of our studies at room temperature because of operational conve- 
nience and under these conditions full conversion was always observed 
overnight. Examination of bromide 1a under these conditions (entry 14) 
showed that the less reactive chloride 1c gave superior results. 

The scope of the alkyl coupling partners was investigated using a variety 
of simple gaseous alkenes (giving products 3b-d) and unfunctiona- 
lized primary alkenes (3e-g) (Fig. 2). High yields and consistently high 
enantioselectivities were obtained. Functionalized alkenes (3h-3r) were 
also examined: halogens, alkynes, aromatic rings, ethers, protected alco- 
hols, trifluoromethyl groups and preformed stereogenic centres were all 
found to be compatible with the procedure. Electron rich alkenes such 
as allylsilane (to give 3i) and styrenes (to give 3m-o) also work well. We 
also examined the use of 5-membered rings with this procedure; we found 
that it is suitable for the direct preparation of highly enantioenriched 
cyclopentenes (4a-c) from an easily prepared racemic 5-membered 
allylchloride. 

To probe practical aspects of this chemistry, scale-up reactions were 
performed (Fig. 3a). Despite excellent conversion (>90%), the yield of 4b 
ona 15 mmol scale was low because the product and the residual alkene 
have similar polarities and boiling points, complicating isolation. Even 
so, we were easily able to obtain more than 1.8 g of pure 4b (58% yield, 
94% e.e.). When 1-hexene was used (to give 4a), we arbitrarily reduced 
the ratio of alkene and Schwartz reagent used (to 2 and 1.8 equiv. respec- 
tively) and observed improved yield (86% yield, 91% e.e., 1.2 g) relative 
to using our standard test conditions (Fig. 2). Functionalized 3n con- 
taining a bromide (Fig. 3b) was also synthesized on a preparative scale 
(88% yield, 87% e.e., 0.75 g) and a subsequent Heck reaction to provide 
cis-fused tricycle 5 (88% yield) demonstrates that these reactions can be 
used to rapidly access complex molecular frameworks. 

To demonstrate that this method allows rapid access to important 
structural motifs, we performed asymmetric syntheses of biologically 
active cyclopentene-containing natural products (Fig. 3c). This approach 
provides short, straightforward, flexible syntheses which compare fav- 
ourably to those previously reported”’, and will facilitate the study of 
structural analogues. Hydnocarpic acid (6) and chaulmoogric acid (7) 
are leprosy treatments used in ancient and traditional medicine: in both 
cases, the cyclopentenyl ring is a requirement for biological activity. 
Hydnocarpic acid is believed to act by blocking the activity of biotin or 
inhibiting microbial biotin synthesis”, while the activity of chaul- 
moogric acid is probably due to incorporation into the cell wall lipids 
of Mycobacterium leprae’. Recent reports show that chaulmoogric acid 
and anthelminthicin C (8) significantly inhibit Mycobacterium tuber- 
culosis growth with minimum inhibitory concentration values of 9.82 
and 4.38 |.M, respectively”*. Anthelminthicin C has also recently been 
found to inhibit para-aminobenzoic acid biosynthesis (pABAB)*; inhib- 
itors of pABAB are important leads for the development of new anti- 
biotics, as this pathway is not found in humans”. 

On the basis of previously reported reactions it seemed likely that 
DYKAT was occurring by one of the four following pathways: (1) the 
reaction proceeds through a pseudo-meso- or prochiral 1-allyl-Cu 
intermediate” similar to the Pd- or Ir-intermediates observed with 
stabilized nucleophiles®; (2) a related scenario, in which the reaction 
proceeds through rapidly interconverting intermediates derived from 
1d (such as o-allyl-Cu species), one enantiomer of which undergoes a 
selective reaction’; (3) the two enantiomers of 1d undergo two different 
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Figure 2 | Dynamic kinetic AAA with alkylzirconium nucleophiles 
generated from alkenes. Top row, alkenes react with the Schwartz reagent 
(Cp2ZrHCl) in CH,Cl, to generate alkylzirconocene species in situ which 
undergo AAA with racemic cyclic allylic chlorides. Cp, cyclopentadieny]. 
Conditions: 2.5 equiv. alkene, 2 equiv. Cp2ZrHCl, 10 mol% Cul, 10 mol% 
ligand C, room temperature overnight, argon atmosphere. Lower rows, yield 


reactions, both giving the same product (an enantio-convergent trans- 
formation)”; or (4) 1d racemizes during the reaction’ and AAA is selec- 
tive for one of the two enantiomers. 

To gain insight into the mechanism, we followed reactions in situ 
using 'H NMR spectroscopy. During these experiments, we observed 
clean conversion of 1d to either 3a or 3f, and the formation of low, but 
fairly constant, concentrations of allyliodide 10 (Fig. 4a, also see Sup- 
plementary Information). The initial drop in the concentration of 1d 
is an artefact of inconsistent NMR field inhomogeneities (‘shimming’) 
at early stages of the reaction. We were unable to observe the formation 
of alkyl-copper species (that would be produced by transmetallation 
from zirconium to copper) or o- or m-allyl-copper species under these 
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and enantiomeric excess of illustrated products. Yield refers to pure isolated 
compounds; the enantiomeric excess was either determined directly by 
high-performance liquid chromatography, or on the corresponding epoxides 
by gas chromatography, using a chiral non-racemic stationary phase. 
*Prepared using (S,S,S)-C (the enantiomer of the ligand shown in Table 1). 


conditions, and we are unable to rule out the possibility of such species 
playing a role in the enantioselectivity of these reactions. 
Examination of allyliodide 10 generated from 1d (by Cul and C in 
CDCI) using 2-dimensional nuclear Overhauser effect NMR spectro- 
scopy (NOESY) shows that the H, and H, protons of 10 rapidly inter- 
convert (as indicated by cross-peaks between these proton signals, Fig. 4b, 
top right). At ambient temperature 1d did not undergo observable 
dynamic processes, but at 55 °C (see Supplementary Information) we 
were able to detect interconversion between 1d and 10, demonstrating 
that the allyliodide and allylchloride interconvert, but not fast enough 
to be detected on the NMR timescale at room temperature. Under these 
conditions 1d was not observed to isomerize directly. Monitoring an 
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Figure 3 | Scale-up and applications of the method to access tricyclic 
structures and natural products. a, Experiments on a preparative scale to give 
>1g of cyclopentene products. b, Preparative-scale reaction on a 6-membered 
ring followed by Heck reaction to give cis-fused tricycle 5. c, Concise 
asymmetric synthesis of hydnocarpic acid (6), chaulmoogric acid (7) and 
anthelminthicin C (8). The syntheses start from commercially available 
11-bromo-1-undecene to give bromo-substituted 4d which is converted to 


Chaulmoogric acid (7) 


6 (by hydrolysis, then oxidation), or to 9 (by epoxide opening chain extension) 
followed by oxidation to 7. Coupling 7 with racemic solketal, then acid 
catalysed deprotection, gave anthelminthicin C (8). *Made using (S,S,S)-C (the 
enantiomer of the ligand shown in Table 1). DMP, Dess-Martin periodinane; 
DCC, dicyclohexylcarbodiimide; dppp, 1,3-bis(diphenylphosphino)propane; 
DMAP, 4-dimethylaminopyridine. Amberlyst 15 is a polymeric acid resin. 
See Supplementary Information for full details. 
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Figure 4 | Mechanistic analysis. a, Reaction kinetics as monitored by in situ 
“H NMR spectroscopy. Formation of an allyliodide intermediate (10) is 
observed. Plot at right shows concentration versus time. b, Examination of 
intermediates 10 (generated from Cul and C in CDC];) by NOESY: when 
generated from 1d (top, proton chemical shifts on both axes, shift ranges as 
follows: vertical, 4.5-6.3 p.p.m.; horizontal, 6.2-5.05 p.p.m.), allyliodide 
protons labelled H, and H, exchange. When generated from 1a (bottom, 
proton chemical shifts on both axes, shift ranges as follows: vertical, 4.1- 

6.3 p.p.m.; horizontal, 6.2-4.6 p.p.m.), rapid H,/H, isomerization is observed 
within both 1a and 10. Rapid exchange between 10 and 1a is also observed. 


AAA reaction to form 3a from 1d by in situ NOESY (see Supplementary 
Information) demonstrates that allyliodide 10 undergoes dynamic iso- 
merization during the AAA itself. 

When allylbromide 1a is dissolved in CDC], at room temperature, 
no dynamic processes are observed by NOESY (see Supplementary Infor- 
mation), but in the presence of Cul and Ca variety of fast exchanges are 
observed (Fig. 4b, bottom right). Allyliodide 10 is formed, and both la 
and 10 rapidly isomerize (as indicated by cross peaks between H,(Br)/ 
H.(Br) of laand H,(1)/H(D of 10). Rapid interconversion of laand 10 
occurs as indicated by cross peaks between H,(Br)/H,(I) and H,(I)/ 
H,(Br). As cross peaks between H,(Br)/H,(I) and H.(Br)/H_(1) are not 
observed, interconversion appears to exclusively occur by S$y2’ mech- 
anisms, although S\2 exchange may occur through minor pathways 
not observable on the NMR timescale. 
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Cross peaks between H,(Br)/H.(I) and H,(1)/H.(Br), but not between 
H,(Br)/H,(D) or H.(Br)/H,(1), suggest interconversion occurs by an Sy2’ 
pathway. c, Proposed mechanism for the DYKAT: as 1d racemizes via 10, one 
enantiomer of 1d undergoes selective AAA. d, Possible racemization 
mechanism: metal-assisted syn-Sy2' attack. e, 'H chemical shifts (vertical, 
relative intensity, 0.00-0.25; horizontal, chemical shift, 4.65-4.00 p.p.m.) of the 
benzylic protons of C are consistent with a change in metal-ligand aggregation, 
as a function of time (colours are for different spectra, taken at different times), 
from the initial [L,,Cul],, species. 


Taken together, these studies point to a mechanism (Fig. 4c) where 
1d racemizes, via 10, and selective AAA of one of the enantiomers of 
1d gives a product with high enantiomeric excess. When using copper 
halides, higher enantioselectivities are obtained when the halide is a better 
nucleophile and leaving group (that is, I > Br > Cl, Table 1, entries 8-10), 
facilitating interconversion, but the overall reaction rates with CuCl 
and Cul are similar, suggesting that it is 1d that undergoes AAA. 

Racemization probably occurs through syn-Sy2’ displacement reac- 
tions: Sy2’ pathways are observed by NOESY, and anti-Sy2’ reactions 
would not change the absolute sense of stereochemistry of the allylic 
halides. Many allylic substitutions are syn-Sy2’ selective, and ‘ion-pair’ 
nucleophiles undergo selective syn-Sy2’ reactions through cyclic transi- 
tion structures”. Our studies are consistent with the copper-mediated 
syn- Sn2" pathway shown in Fig. 4d. 
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We followed the enantiomeric excess of 3f formed as a function of 
time using CuCl, Cul and CuOTf. When using CuOTf, the reaction goes 
to completion much faster (<30 min) than with copper halides (over- 
night) and we did not detect any change in e.e. with time, so we tenta- 
tively speculate that CuOTf and Cu-halide catalysed reactions (Table 1, 
entry 6) are mechanistically distinct. Reactions using CuCl show a decrease 
of e.e. with increasing time (~82% e.e., 10 min; 68% e.e., 2 h; 54% e.e. at 
completion) and we speculate that when CuCl is used, the rapidly react- 
ing enantiomer of 1d is consumed, and not replenished by fast race- 
mization. The reaction with the minor enantiomer therefore becomes 
significant and reduces the e.e. of 3f as time progresses. We observed 
non-racemic 1d (at least 11% e.e. at 2h) during the reaction as judged 
by gas chromatography analysis (see Supplementary Information). In 
the case of Cul the initial reaction rate is slow, and only very small 
amounts of 3f are formed after 10 and 30 min. Here the e.e. increases 
with time (~85% e.e., 10 min; 90% e.e., 2 h; 95% e.e. at completion) sug- 
gesting that a more enantioselective system is generated as the reaction 
progresses. 

At least part of the role of Cul is to racemize 1d via 10, but the iden- 
tity of the proposed highly enantioselective catalyst is unclear. Both 
CuCl and Cul give lower initial values of e.e. than are observed at later 
stages of the Cul-catalysed reaction. It may be that copper aggregates 
containing both “Cl and T are more selective than those containing 
only one halogen. An experiment performed using 10% C + 5% CuCl 
+ 5% Cul ambiguously gave 3f with 90% e.e. (rather than 95% e.e.) at 
completion. Alternatively, salt effects may increase the e.e. of 3f as time 
progresses by favouring the formation of more highly enantioselective 
copper-ligand complexes; the reaction produces Cp,ZrCl, which may 
affect the relative solubility of other reaction components. 

"H NMR spectroscopy studies suggest that copper-ligand aggregates 
change during the reaction: we observe a shift in the benzylic proton 
signal of ligand C, from being characteristic of [L,,Cul],, complexes, 
towards new species as time progresses (Fig. 4e, see also Supplementary 
Information). Cu-halide phosphoramidite aggregates exhibit complex 
dynamic equilibria between several solution- and solid-phase species, 
which are sensitive to solvent as well as halide and salt effects*°, and so 
determining the actual composition of the copper species generated in 
the reaction will require further work. How the Cu-ligand complex selects 
for one enantiomer of 1d, and how the Cu-ligand complex interacts 
with the alkylzirconium species, are also both currently unclear. 

We have described a copper-catalysed enantioselective addition of 
alkyl zirconium reagents to racemic cyclic allylic chlorides. The reaction 
uses readily available starting materials and catalysts, tolerates a variety 
of functional groups and operates under convenient conditions. The 
reactions can be performed on gram scales, and we have applied the 
reaction to the asymmetric synthesis of biologically active cyclopen- 
tene natural products. Mechanistic studies suggest that this dynamic 
kinetic asymmetric transformation operates via a rapidly interconvert- 
ing intermediate, racemizing the substrate, and the formation of a more 
highly selective copper-catalyst in situ. This reaction is expected to com- 
plement the well-established palladium and iridium-catalysed methods 
with stabilized nucleophiles. Additionally, we anticipate that the mech- 
anistic insight will inspire future studies in the field. 
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The terrestrial uranium isotope cycle 


Morten B. Andersen”, Tim Elliott', Heye Freymuth', Kenneth W. W. Sims*, Yaoling Niu* & Katherine A. Kelley* 


Changing conditions on the Earth’s surface can have a remarkable 
influence on the composition of its overwhelmingly more massive 
interior. The global distribution of uranium is a notable example. 
In early Earth history, the continental crust was enriched in uranium. 
Yet after the initial rise in atmospheric oxygen, about 2.4 billion years 
ago, the aqueous mobility of oxidized uranium resulted in its signi- 
ficant transport to the oceans and, ultimately, by means of subduc- 
tion, back to the mantle’ *. Here we explore the isotopic characteristics 
of this global uranium cycle. We show that the subducted flux of 
uranium is isotopically distinct, with high *°U/?°U ratios, as a result 
of alteration processes at the bottom of an oxic ocean. We also find 
that mid-ocean-ridge basalts (MORBs) have **°U/?*°U ratios higher 
than does the bulk Earth, confirming the widespread pollution of 
the upper mantle with this recycled uranium. Although many ocean 
island basalts (OIBs) are argued to contain a recycled component’, 
their uranium isotopic compositions do not differ from those of the 
bulk Earth. Because subducted uranium was probably isotopically 
unfractionated before full oceanic oxidation, about 600 million years 
ago, this observation reflects the greater antiquity of OIB sources. 
Elemental and isotope systematics of uranium in OIBs are strikingly 
consistent with previous OIB lead model ages”, indicating that these 
mantle reservoirs formed between 2.4 and 1.8 billion years ago. In 
contrast, the uranium isotopic composition of MORB requires the 
convective stirring of recycled uranium throughout the upper mantle 
within the past 600 million years. 

Recycling of U from the surface to the Earth’s deep interior can be 
monitored by a decrease in the Th/U ratio of the mantle” *. Thorium pro- 
vides a valuable reference for several reasons. First, U and Th behave 
similarly as tetravalent species in the mantle, such that they are difficult 
to fractionate significantly by melting processes. Only under more oxi- 
dized surface conditions do the elements show contrasting behaviour, 
with Th remaining tetravalent and immobile during weathering, unlike 
highly water-soluble hexavalent U species. Second, both Th and U are 
refractory, lithophile elements, and so the Th/U of the silicate Earth can 
be estimated from measurements of meteorites. This planetary Th/U 
reference has recently been refined" to a value of 3.876. The Th/U of the 
terrestrial upper mantle, as inferred from analyses of MORBs, is notably 
lower than this value; two global studies of MORB yield a mean Th/U 
of ~3.1 (refs 12, 13). The low Th/U of the upper mantle is explained by 
addition of significant recycled U from the surface and can be recon- 
ciled with a surprisingly high time-integrated Th/U (ref. 14), as gauged 
from *°*Pb/**’Pb ratios, if the U recycling commenced in the latter half 
of Earth history’ *. This makes good geological sense, as before the 
Great Oxidation Event (GOE) ~2.4 Gyr ago (see, for example, ref. 15) a 
reduced atmosphere inhibited the surface mobility of U and prevented 
U recycling. 

Here we test and extend this model of global U cycling using isotopic 
measurements of U to complement the inferences from elemental Th/U. 
Recent work'*’” has shown that surface processes induce U isotopic 
variations (~1%o) that are significantly greater than the typical ana- 
lytical precision (~0.05%o). Natural variations in ?**U/?*°U are chiefly 
linked to the reduction of U(v1) to U(1v), and the magnitudes of such 


fractionations are inversely proportional to temperature’*”’. So whereas 
U isotopic ratios can be perturbed at the surface, the high temperatures 
and dominance of tetravalent U in the mantle inhibit significant iso- 
topic fractionations at depth. Any ‘exotic’ **°U/?*°U signatures, produced 
by low-temperature fractionation and transported into the mantle, 
should therefore provide a robust tracer of surface-processed U. 

To explore this possibility, we have characterized, to high precision, 
the 8°°°U (the parts per thousand difference in 7**U/?*°U relative to a 
reference solution standard, CRM 145) ofa range of samples including 
meteorites, mantle-derived basalts, and the inputs and outputs of an 
archetypal subduction zone. A summary of our results is plotted in Fig. 1 
and reported in Table 1. Measurement precision varied with sample size, 
but typically the magnitude of the error in 8**°U was less than +0.03%o 
(2s.e.). For most samples, the resolution of natural variability was 
limited by our long-term reproducibility. This was gauged from repeat 
measurements of the geological standard BHVO-2, which yielded 
5°8U = —0.314%o £ 0.028% (2 s.d. on 21 replicates). We additionally 
obtained 7**U/??°U measurements, which provide a valuable monitor of 
recent U disturbance. All key samples gave values within error (+3%bo) 
of secular equilibrium for (7°*U/?**U), the ?**U/?*8U activity ratio. Fur- 
ther details of our analytical procedures and values for a wider range of 
standards measured are provided in Methods. 

Primitive, chondritic meteorites are typically used as a reference for 
bulk planetary compositions, and so we analysed several ordinary chon- 
drites to try to define a reference ‘bulk Earth’ 5°°*U. The very low U 
contents of chondrites make high-precision measurements especially 
challenging, so we further analysed two eucrites. These higher-[U] sam- 
ples should still provide a useful planetary datum given the incompatible, 
lithophile and refractory nature of U. Our meteorite analyses have 5**°U 
values that range from —0.44%bo to —0.30%o, overlapping with two exist- 
ing, lower precision determinations of chondritic eU (—0.42 + 0.09%o 
(ref. 20) and —0.37 + 0.09%bo (ref. 21)). The variability in 8°°8U of our 
meteorite analyses, however, is greater than the measurement pre- 
cision and probably reflects recent disturbance of some of our samples 
(Methods). Thus, we propose a planetary estimate based on the weighted 
average of the two unaltered samples, with (***U/?*8U) within error of 
unity. This yields 8°°°U = —0.306 + 0.026%. 

We wish to compare the 5”*°U of the terrestrial mantle with this new 
meteoritic datum, inferred to represent bulk Earth. To this end, we have 
analysed a wide range of basalts (Methods), which effectively sample the 
mantle for an incompatible element such as U. Whereas the shallow 
convecting mantle is probed by MORBs, OIBs are widely assumed to 
be generated from hot, upwelling plumes (possibly containing com- 
ponents from recycled plates) that provide a window into the deeper 
mantle. We have analysed fresh MORB glass from all three major oceanic 
basins and OIBs that cover a large portion of mantle heterogeneity as 
gauged from radiogenic isotopic compositions. It is clear from Fig. 1 
and Table 1 that MORBs and OIBs have different mean 57*°U. OIBs 
with a wide range of Th/U have 8°8U within the error of the bulk Earth, 
whereas MORBs have significantly higher 6”°°U at lower Th/U. The 
strikingly superchondritic 5°*°U we observe in MORBs would strongly 
support the scenario of widespread pollution of the upper mantle with 
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Figure 1 | Uranium isotopic compositions (5”**U) versus Th/U ratios for 
mantle-derived basalts and altered oceanic crust. a, OIBs (circles) have 8°°U 
similar to the bulk Earth (crossed square), whereas the higher 835U and 
lower Th/U of MORBs (diamonds) imply a mixture (shown as a grey line) 
between bulk Earth and average modern altered oceanic crust (AOC; square, 
showing the ODP Site 801 supercomposite). Mariana arc basalts (triangles) 
show positive covariation of Th/U and 5”**U, reflecting the mixing of two 


surface U, if recycled U were isotopically heavy. Thus, we examine the 
U isotopic composition of the subducting plate. 

Although variations in 8°°*U in the sedimentary environment are 
now established, appropriate measurements to characterize subducted 
materials are unavailable. We principally focus on determining the iso- 
topic composition of U added by submarine alteration to the igneous 
oceanic crust, which is the key flux in accounting for the low Th/U of 
MORBs**. Comprehensive studies of cores obtained from deep drilling 
through oceanic crust**””? (see also the review in ref. 24) demonstrate 
that heterogeneous addition of U has occurred throughout the upper 
~500 m of the mafic AOC. Subduction of this ‘excess’ U is sufficient to 
lower the Th/U of the upper mantle to 2.5 in ~2 Gyr (ref. 6). 

We have analysed ‘composite’ samples from ODP Site 801, which pen- 
etrates 420 m into ~170 Myr-old Pacific mafic crust”. The composites 
are mixtures of the different lithologies and alteration styles present, 
blended as powders in representative proportions. This is an efficient 
means of obtaining a reliable average composition of the heterogeneously 
altered mafic crust?*”>. The 5°*8U of the composites varies from —0.45%o 
in the uppermost part (0-110 m) to higher values (—0.15%o to +0.16%0) 
in the deeper parts (110-420 m) (Table 1). This variability probably 
reflects a change from oxic incorporation of U near the surface’® to 
(partial) reductive U roll-front-type sequestration”, in keeping with a 
generally observed change in alteration style*”*. The ‘supercomposite’ 
from ODP Site 801, representing a weighted average of the full 420 m 
upper crustal section, has 8°°°U = —0.17%o0 and [U] =0.39pg¢g ', 


Table 1 | Summary of 573°U for sample groups 


Sample type 8238U+2s.e Th/U N 
Unaltered meteorites (bulk Earth) —0.306 + 0.026 3.84 2 

OIBs —0.308 + 0.005 3.48 19 
MORBs —0.268 + 0.011 2.44 11 
Mafic AOC 

Average —0.170 + 0.026 0.44 1 

0-110m —0.436 + 0.042 0.30 3 

110-220m +0.164 + 0.086 0.45 3 

220-420 m —0.145 + 0.045 0.44 3 

Mariana arc lavas 

Guguan (fluid rich) —0.405 + 0.023 173: 2 

Uracas (sediment rich) —0.333 + 0.016 2.84 2 

Average Mariana sediments —0.354 + 0.039 = 7 

Seawater (open ocean) —0.390 + 0.006 = 3 


A full table with individual data is presented in Supplementary Table 1. 
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components: a sedimentary source, with high Th/U and 87°8U; and a source 
with low Th/U and low 5?°*8U similar to the upper section of the AOC (open 
triangle, showing the 0-110 m ODP Site 801 composite). The brown arrow 
shows the best fit of the Mariana arc data pointing towards the inferred fluid 
component from the AOC. b, Histogram highlighting the distinctively heavy U 
isotope composition of MORBs relative to the bulk Earth and most OIBs. 


which is five times higher than the unaltered basalts®. This superchon- 
dritic value for the average 5°**U reflects the dominance of reductive 
addition of U(Iv) to the upper oceanic crust as a whole (Methods). 
Interestingly, MORB compositions lie close to a simplistic mixing line 
between this average altered crustal composition and the bulk Earth 
reference (Fig. 1). 
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Figure 2 | Cartoon of the terrestrial U isotope cycle over the history of Earth. 
a, Prior to the GOE ~2.4 Gyr ago, low atmospheric oxygen levels limited U 
mobility on the surface. b, The GOE would have heralded an enhanced U 
weathering flux to the oceans. Quantitative release of U during weathering 
would yield a U flux to the oceans with &U =~ —0.3%o, and quantitative 
extraction of U into the dominant reducing sinks from the ocean, including 
AOG, would also give 5°°*U ~ —0.3%o for any recycled U delivered to the 
mantle. c, During the past 600 Myr U has been isotopically fractionated during 
partial, reductive uptake of U into the AOC from the now largely oxic 
oceans. This fractionated U in the subducted AOC U fluxis released at different 
depths; initially the isotopically light U in the uppermost crust is lost to arc 
magmatism, and the heavy U from the deeper crust is released beyond the arc 
front into the convecting upper mantle. The residual, deep-subducted crust has 
a U isotopic composition similar to unaltered MORB. 
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Figure 3 | Pb model ages versus Th/U in OIB mantle sources. Two-stage 
Pb model ages (?°’Pb-*°Pb ages") versus time-integrated Th/U, using 
measured ?"°*Pb—*°°Pb and model source ages (a) and measured Th/U for 
the same suite of OIB samples (b): Hawaii (1), Iceland (2), Azores I (3), La 
Palma (4), French Polynesia (5), Samoa (6), Azores II (7) and Réunion (8) 


Finally, we assess the consequences of subduction for the U isotope 
composition of recycled crust, using the Mariana arc as a well-understood 
example. During subduction, material is lost from the downgoing plate 
and is incorporated into magmas erupted at island arcs. The Mariana 
arc lavas show evidence for two slab-derived components; a melt from 
the sedimentary section and a ‘fluid’ from the mafic oceanic crust”*. A 
regression line through the array of Marianas lavas in Fig. 1 should point 
towards the sediment component at high Th/U and the fluid compon- 
ent at low Th/U. The average 5°°°U of sediments subducting beneath 
the Mariana (—0.35%o0 + 0.04%o; Table 1) is compatible with the com- 
positions of the lavas with high Th/U, and an extension of the array of 
Mariana arc lavas to low Th/U indicates that the fluid derived from the 
mafic oceanic crust has a low 8***U. Such an isotopically light com- 
position can be explained if either the U is preferentially lost from the 
uppermost altered mafic crust (Fig. 1), or U isotopes are fractionated 
during (partial) loss from the plate. In either case, the U that is lost to 
the arc is isotopically lighter than the bulk input to the subduction zone, 
and so U transported beyond the arc must become even heavier than its 
initial composition. 

Thus, we are confident that modern surface cycling of U results in a 
substantial flux of isotopically heavy U into the mantle. This observation 
provides a ready explanation for the superchondritic 5”°*°U of MORBs. 
The isotopically heavy U must be carried in the altered, mafic crust beyond 
the zone of arc magmatism, but subsequently lost to the upper mantle 
(Fig. 2). Transport of U in an accessory mineral such as allanite” is a 
possible means of effecting this outcome. 

Like MORBs, many OIBs have significantly subchondritic Th/U, 
indicative of the addition of recycled U to their sources. Yet all OIBs 
have 87°°U within the error of the bulk Earth value, which is inconsis- 
tent with the modern U cycle. We noted above, however, that the high 
5°*°U characteristic of average recent oceanic crust is the result of par- 
tial reduction of U-rich, oxidized seawater as it percolates through the 
submarine volcanic edifice. This scenario has been possible only in the 
last ~600 Myr, since the second rise in oxygen in the late Proterozoic 
eon and the establishment of fully oxic oceans (see, for example, ref. 15). 
Between 600 Myr ago and the initial rise of oxygen ~2.4 Gyr ago, U was 
mobile during surface weathering but was rapidly scavenged from the 
reduced oceans’*. Quantitative removal of riverine U supplied to the 
oceans would result in a flux of U to the sea floor that was isotopically 
unfractionated (Fig. 2 and Methods). The implications of this concep- 
tual model are that OIB sources formed from recycled oceanic crust 
between 2.4 and 0.6 Gyr ago should have increasingly subchondritic 
Th/U, but chondritic 5*°U. Furthermore, any OIB source formed ear- 
lier than 2.4 Gyr ago should have both chondritic Th/U and chondritic 
ou. 

We have further investigated this idea using Pb model ages of several 
OIB suites. Their Pb model ages are calculated using a two-stage model, 
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Measured Th/U 


(Methods). Despite potential recent disturbance of the measured Th/U in OIB 
samples, both plots document a similar relationship of decreasing Th/U with 
decreasing Pb model ages. These increasingly subchondritic Th/U ratios are 
consistent with progressive U addition to the mantle from subduction since the 
GOE ~2.4 Gyr ago. 


similar to that of ref. 10 (Methods). As in ref. 10, we find that OIBs have 
a range of source model ages. Notably, these ages correlate with the 
Th/U of the samples (Fig. 3), in the manner discussed above. We show 
two plots in Fig. 3: one with time-integrated Th/U as determined from 
Pb isotopes (Fig. 3a), and the other with measured Th/U (Fig. 3b). Not 
unsurprisingly, the data using time-integrated Th/U form a tighter array, 
because any recent Th/U fractionations from source composition dur- 
ing melting and melt migration to the surface are removed (Methods). 
However, both plots independently document a similar relationship, 
with Th/U becoming increasingly subchondritic in OIB sources younger 
than ~2.4 Gyr. 

The remarkable implication of the ideas presented above is that the 
two-stage rise in atmospheric oxygen, reconstructed from observations 
on the Earth’s surface, is reflected in the Th-U-Pb systematics of mantle- 
derived basalts. The coherence of our observations with those antici- 
pated from a first-order model of U recycling also lends credence to the 
significance of Pb model ages of OIB sources”. The range of OIB model 
ages therefore places valuable constraints on the maximum and min- 
imum incubation times of an OIB reservoir, which may be the longest 
and shortest residence times of subducted slabs at the base of the mantle 
before they become sufficiently buoyant to return to the surface. Fur- 
thermore, our inference that isotopically heavy U has been introduced 
into the mantle over only the last 600 Myr yields a maximum timescale 
for its effective stirring into the MORB source. This value is reassuringly 
consistent with an estimate based on a markedly different approach 
using Pb isotopes”. 

Online Content Methods, along with any additional Extended Data display items 


and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Sample preparation. With two main exceptions, existing powders of samples were 
used for this study. Preparation techniques for these powders are documented in 
the associated studies. Powdered samples of bulk meteorites were prepared at the 
University of Bristol for this study, from chips carefully picked under the micro- 
scope to be free from alteration, fusion crust or signs of saw-blade marks. Likewise, 
all MORB samples were prepared afresh by picking millimetre-sized fragments of 
glass without signs of alteration, surface coatings or devitrification. 

Given the low U concentrations of MORB, the consequences of possible secondary 
U addition by absorption to marine Fe-Mn oxyhydroxide coatings is a concern. 
Although careful hand-picking should address this problem, we further processed 
the picked glass, using a mildly reductive procedure based on methods for dissolv- 
ing Fe-Mn coatings in marine sediments, using a mixture of 0.05 N hydroxylamine 
hydrochloride, 15% acetic acid and 0.03 N Na-EDTA at pH ~4 (ref. 31). Similar 
approaches have been used in preparing MORB glasses for U-series analysis*>-**. 
Each sample was leached three consecutive times by adding 12 ml of the leaching 
solution to the bulk samples in pre-cleaned centrifuge tubes, and placing these in a 
vortex shaker for 24h at room temperature (~20 °C). Samples were thoroughly 
rinsed in 18 MQ cm water following each leaching step. 

The elemental concentrations of each leaching solution were determined using the 
Element 2 ICP-MS of the Bristol Isotope Group, using already established methods”. 
Potential contaminant U removed in each leach fraction was monitored using the 
total U concentration and the ratios of U to immobile elements Th, Sc, Ti and Zr 
(Supplementary Table 2). For most samples, the first leach released U in higher 
proportions than the immobile elements; however, at the end the three leaching 
steps the ratios of U to the refractory elements approached the ratios measured in 
the total bulk (residual) MORB sample after full dissolution. This suggests that any 
minor, absorbed U was effectively leached away. The U removed during reductive 
leaching represented only 1% to 4% of the total U that remained in the bulk residue. 
All MORB samples have (7**U/?**U) within a few per mil of secular equilibrium, 
except D18-1, which hada ?4u?8U) ~14%o above secular equilibrium, suggest- 
ing that some U contaminant was still present after the reductive leaching. The ele- 
vated (7*4U/?*8U) of seawater (1.146) suggests that this contamination is marine 
derived. Despite the minor residual U contamination, we have chosen to leave it in 
our data set, but note its value is not as robust as the other measurements (see below 
for further discussion). 

In addition to the natural samples, a basalt powder was made into a glass and 
leached to assess whether the reductive leaching caused any U isotope fractionation 
in the residual glass. Given the small total amounts of U removed during leaching 
and that the final leaching solutions appeared to have removed material with bulk 
glass composition (Supplementary Table 2), it seemed unlikely that leaching would 
bias the U isotopic composition of the residue; but for completeness we tested this. 
A basalt powder was melted in a platinum crucible and subsequently quenched by 
dropping the platinum crucible into a bath of 18 MQ cm water at room tempera- 
ture. Given the low ages of the MORB glasses in this study, they have experienced 
little U-series o-recoil damage and can therefore be directly compared with the leach- 
ing of the artificial quenched glass. Measurements of the quenched glass gave iden- 
tical 8°**U before and after the reductive leaching step (—0.23%o + 0.03%o versus 
—0.24%o + 0.03%o) and identical (7°*U/?*8U) (0.995 + 0.003 and 0.999 + 0.002), 
showing that the leaching process does not fractionate the U isotopes of fresh glass. 
Sample dissolution, spiking and column chemistry. Sample sizes up to ~1.5g 
were dissolved in a single beaker. For larger samples (for example chondrites and 
MORB) several splits of the same sample were dissolved separately and then all 
aliquots were combined after full dissolution. Terrestrial silicates were dissolved in 
a mixture of concentrated HF/HNO3/HCIO, acid, fluxed on a hotplate for 24h 
and dried by stepwise-increasing the temperature from 120 °C to 200 °C. Samples 
were then fluxed twice in ~6 N HCl (~15 ml per gram of sample) on a hotplate 
(120 °C) and dried down in between. Samples were redissolved in 7 N HNO; and 
then diluted down to 2 N HNO; for TRU Resin column chemistry. Both the eucrites 
(~1 g) and the ordinary chondrites (~4 g in three aliquots) were initially dissolved 
in a similar way to larger terrestrial silicate samples. However, after the 7 N HNO; 
step the dissolution was incomplete and a residue of dark residue remained. These 
residues were isolated from the remaining dissolved sample and refluxed in a 
mixture of 3 ml concentrated HNO; and 1 ml concentrated HCl ina high-pressure, 
Anton-Paar asher (200 °C and 100 bar pressure, in silica glass vials). This step fully 
dissolved the residues, which were then remixed with the already dissolved sample 
aliquots, dried down and redissolved in 7 N HNO; before TRU Resin chemistry 
in 2N HNO,. 

All samples were spiked with the IRMM-3636 **°U-”*°U double spike** (aiming 
for °°U/?°U of ~5), either before dissolution or before column chemistry for sam- 
ples combined from separately processed aliquots. 

The U was separated from all other matrices in a two-step procedure by TRU 
Resin and UTEVA chemistry. The TRU Resin chemistry was optimized for large 


sample sizes with ~2 ml resin loaded in polypropylene Bio-Rad columns. Up to 
~1g of sample was loaded on each column. Samples were loaded and matrix eluted 
using 40 ml 2 N HNO,, and U was collected in 8 ml 0.3 N HF/0.1 N HCl. Samples 
were then dried down and fluxed in concentrated HNO;/H,O, to eliminate any 
organic material from the resin bleeding into the sample. For large samples that 
were split over several columns, these were homogenized at the end of this phase 
and then dried down to be redissolved in 3 N HNO; for UTEVA chemistry. The 
UTEVA chemistry was performed in shrink-fit Teflon columns containing ~0.6 ml 
of resin. Loading and matrix elution steps used 20 ml of 3 N HNOs, before elution 
of Th in 3 ml of 3N HCL, and collection of the purified U in 8 ml 0.3 N HF/0.1N 
HCI. Samples were then dried down, fluxed in concentrated HNO3/H,03, dried 
down and redissolved in the requisite amount of 0.2 N HCl for the desired U con- 
centration (100-300 p.p.b.) for MC-ICPMS measurements. Full U recovery (>95%) 
was obtained using this method with total chemistry blanks of <20 pg for all sam- 
ples (negligible compared with sample sizes >20 ng). 

MC-ICPMS measurement set-up. The U isotope measurements were conducted 
ona Thermo Finnigan Neptune MC-ICP-MS (serial no. 1002) of the Bristol Isotope 
Group, running in low mass resolution (M/AM ~ 500) and using an Aridus desolvat- 
ing nebulizer introduction system. Uranium sample sizes of 40-150 ng were con- 
sumed during individual analysis. The data were collected in static mode ina similar 
fashion to that described in ref. 37. All cups were connected to feedback amplifiers 
with 101! Q resistors, except for the 2280), cup, which was connected to a feedback 
amplifier with a 10’° Q resistor to accommodate a larger ion beam. Uranium tailing 
and hydride formation were monitored as described in ref. 37. Both the 237.05/7°°U 
abundance sensitivity and the hydride and high-side tailing formation at 1 a.m.u. 
(measured as 239.05/*°"U) were (2-3) X 10 °, and remained stable during each 
measurement session. The low-mass side-tailing contributions to tel O Riciial O Read Of 
and?*°U were estimated, and corrected for, from interpolation ofa linear-log fit to 
mass versus tailing intensity, as used in ref. 38. Furthermore, corrections for 232ThH* 
and 1 a.m.u. high-mass tailing on ***U were made to measurement, assuming sim- 
ilar behaviour for Th and U. These were of minimal importance, however, owing to 
the good separation of U from Th during the UTEVA chemistry ?*’Th/?°U < 1). 

Measurements were conducted using typical ion beam intensities of ~1 nA for 
238U ~7 pA for 23517 ~40 pA for 236U and 77U, and ~0.05 pA for 25417, integrated 
over a period of 80 X 4s. Washout and on-peak blank measurements were similar 
to those described in ref. 37. On-peak U blank intensity never exceeded 10 p.p.m. of 
the total 7*°U beam of the sample and was generally <2 p.p.m. 

The Neptune was equipped with a large plasma interface pump (turbo-booster) 
offering enhanced transmission efficiency when combined with ‘jet+X cones’, as 
opposed to ‘standard+X cones’. The data obtained are identical for both set-ups 
(Supplementary Tables 3 and 4). General U transmission efficiencies were ~1.5% 
for standard+X cones and ~3% for jet+X cones. 

After tailing and hydride corrections, the measured 3317/?3°Ly ratio was used 

for mass bias correction, using the exponential mass fractionation law®’. To obtain 
23417238 and?°U/?8U ratios, the minute ~°U, 7*°U and?“‘U contributions from 
the IRMM-3636 spike were subtracted. Based on a calibration using CRM-145 (Sup- 
plementary Table 3), the U isotope ratios used for the Bristol IRMM-3636 were 
°UP?U = 0.98130, UP? *U = 4,259, °°U/?°U = 21,988 and “UU = 2,770. 
These ratios are identical to the certified ratios from IRMM-3636* for *°U/??U 
(0.98130 + 0.00015), °°U/?*8U (4,259 + 7) and?*°U/?*U (21988 + 36), but diverge 
slightly for °°U/?*4U (2732 + 4). Measurements of all unknown samples were 
bracketed individually and normalized to CRM-145 standard measurements, spiked 
with IRMM-3636, in a similar fashion to the unknowns. 
Measurement performance, reproducibility and accuracy. For each measure- 
ment sequence, the mean of the absolute ?38U7/?3°U of the CRM-145 standard used 
was typically within +50 p.p.m. of the value (137.832) reported for the NBL 112a 
standard*’, and the 7**U/7"U ratios were within +2%o of published ratios for the 
CRM-145 standard”. The (7**U/?*8U) activity ratios were calculated using the half- 
lives in ref. 40. All samples in this study were normalized to the CRM-145 standard, 
such that any deviations relative to the absolute ratio of the CRM-145 standard, 
potentially related to a non-exponential component of instrumental mass frac- 
tionation, are corrected. 

Given the use of mixed feedback amplifier resistors with different response times, 
internal error estimates may give a misleading impression of true precision (see, 
for example, ref. 41). Thus, to test the full external reproducibilities for individual 
unknown samples, a total of eight splits of BHVO-2 were individually processed 
(dissolution, spiking and chemistry) and measured during different analytical ses- 
sions (with different set-ups; see Supplementary Table 4). The external reproduc- 
ibility of 5°8U for BHVO-2 was +28 p.p.m. (2s.d.) anda similar or better external 
reproducibility was obtained for the basalt LP 45 E (a historic basanite from La Palma), 
the in-house CZ-1 uraninite and three open-ocean seawater samples (Extended Data 
Fig. 1). Other samples measured in duplicate agreed within their 2 s.e. estimates 
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(Supplementary Table 4). The reproducibilities of **4U/?*8U were limited by the 
low 7“‘U intensities (<0.2 pA), but were +3%o or better for the standards. 

Potential artefacts in the measured 8°*°U from different U/matrix ratios are 

unlikely given that ordinary chondrites (low U/matrix ratio) and OIB (high U/matrix 
ratio) both have lower 8°*°U compared with MORB (intermediate U/matrix ratio). 
Furthermore, consistent results are obtained when comparing the standards mea- 
sured in this study and other studies using comparable normalizing standards 
(NBL 112a’”, SRM-950a”! and CRM-145'°). The measured uraninite CZ-1 stan- 
dard (87°U = —0.053%o + 0.029%o) is within error of the value reported in ref. 16 
(—0.10%o + 0.07%). Similarly, the BHVO-2 measurements in this study (8°8U = 
—0.314%o0 + 0.028%) compare well with measurements in ref. 21 (—0.32%0 + 
0.07%o). Finally, our open-ocean seawater measurements (87*8U = —0.390%o + 
0.018%) agree well with open-ocean seawater measurements in ref. 17 (—0.41%o0 
+ 0,03%o). 
Uranium in extraterrestrial material and a bulk Earth 8°**U. Previous work has 
shown large variability in the 8*°5U of extraterrestrial material2°*!“?“*, Thus, even 
if it exists, estimating a uniform chondritic 8***U composition is challenging. 
Some of the observed 8°*8U heterogeneity has been attributed to variable addi- 
tion of 7°U from the decay of, now extinct, ?47Cm (ref. 45). However, meteorites 
also show relative depletions in **°U, indicating that other processes may have a 
role”! (for example nucleosynthetic anomalies or planetary formation processes). 
An additional concern is terrestrial perturbation of U, which may be indicated from 
the physical preservation and anomalous chemical composition. Specifically for U, 
Th/U departing from the recently defined meteoritic reference" and (7*4U/?*8U) 
activity ratios out of secular equilibrium may testify to planetary body processes” 
or terrestrial perturbation. 

Owing to the generally low U abundance in meteorites, it is necessary to obtain 
large chondrite samples (~5-10 g) to allow high-precision 8°**U measurements, 
which can run counter to museum loan policies. Thus, we initially honed our tech- 
nique on three large desert meteorite ‘finds’ (M2, M12 and M15), provided by M. 
Anand from the Open University. These had previously been studied petrogra- 
phically and characterized at the University of Bristol. We had prepared several 
hundred grams of powder from the interiors of these ordinary chondrites. As finds, 
however, these samples were potentially perturbed by terrestrial weathering. We 
subsequently obtained large (~25 g) samples of two ‘falls’ (Zag and Saratov) from 
the Meteorite Market (http://www.meteoritemarket.com/). All these samples are 
ordinary chondrites, which are not only more readily available than carbonaceous 
chondrites, but are isotopically more similar to the Earth (see, for example, refs 48, 49). 

We supplemented our ordinary chondrite measurements with analyses of two 
eucrites, Juvinas and Stannern, kindly provided by the Natural History Museum, 
London. Eucrites have higher U contents and so require smaller sample sizes (~1 g) 
for high-precision analyses. Although differentiated meteorites, the isotope ratio of 
a highly incompatible element such as U should be minimally affected during crust 
formation, and so we believe that these samples still provide a valuable planetary 
reference. Notably, eucrites generally have chondritic Th/U ratios” ”’, indicating 
an absence of elemental fractionation during their formation. The eucrite samples 
we analysed were falls, and so are probably less prone to terrestrial weathering than 
the finds. 

Of the ordinary chondrites, two gave identical, but relatively low, &°8U ratios 
(M15 (—0.439%o + 0.030%o) and Saratov (—0.442%o + 0.050%o)), whereas the three 
others were all within error but were ~100 p.p.m. higher (M2 (—0.322%o0 + 0.030%o), 
M12 (—0.326%0 + 0.022%o) and Zag (—0.301%0 + 0.050%o)). The eucrites also dif- 
fered in their 5°°8U (Juvinas (—0.312%o + 0.030%o) and Stannern (—0.369%o = 
0.030%o)), but with Juvinas overlapping with the compositions of the heaviest ordi- 
nary chondrites (Extended Data Fig. 2). 

The three ordinary chondrite desert finds have elevated (34U/?38U) ratios (1 to 
12% 7U excess) suggesting oxidative weathering during their time at the Earth’s 
surface, with oxidation of Fe potentially promoting mineral surfaces for U sorp- 
tion, with a positive correlation between U concentration and (4U/?8U) ratios 
(Extended Data Fig. 2). Furthermore, Saratov also had ~1% elevated C* uP u), 
whereas Stannern was ~1% depleted in (7*4U/?**U) relative to secular equilibrium. 
Only the ordinary chondrite Zag and eucrite Juvinas were at secular equilibrium 
for (74U/?*8U). The independent constraints provided by (?**U/?*8U) show that 
only Juvinas and Zag can be considered pristine and, notably, their 5**°U values are 
within error of each other (Extended Data Fig. 2). 

Perturbation of U in the meteorite samples is also indicated by their Th/U ratios 
relative to the planetary reference value’’ of 3.876 + 0.016. The ordinary chondrites 
with the lowest 5**°U have the highest [U] and lowest Th/U, further suggestive of 
U addition. Three samples (Juvinas, Zag and M2) have Th/U within error of the 
reference value, and all have 87°°U values within error of each other (Extended 
Data Fig. 2). However, given the minor (4uP8U) excess in M2, we do not include 
this in our best estimate of the bulk Earth value, 87°°U = —0.306%o + 0.026%o, pro- 
vided by the weighted average and weighted 2:.e. of Juvinas and Zag. Despite 
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demonstrable open-system behaviour of U, the mean of all meteorite samples 
gives a 5°°U of —0.36%o + 0.04%o (+2 s.e.m.), which is within error of the weighted 
estimate from pristine samples (Extended Data Fig. 2). Although our best estimate 
for the bulk Earth from our meteoritic samples is defined by only two samples, and 
would usefully be substantiated by additional measurements, we believe that the 
systematics of the altered samples provide important evidence to support the sig- 
nificance of this best estimate. Moreover, in terms of our main observations on 
terrestrial samples the choice is not critical; for either the mean of all the meteor- 
ites or just the pristine ones, the 5°°°U of MORB are heavier while the 5°*°U of 
OIBs are unresolved from these meteoritic values. 

5735U in OIBs. A suite of 19 OIBs from Iceland, Cape Verde, Azores, Canary Islands 
and Hawaii were measured. Further details on these samples are provided in Sup- 
plementary Table 1. We have dominantly used historic samples, collected previ- 
ously for U-series studies, which have the major advantage of being fresh. Notably, 
we analysed four non-historic, but still relatively young (~1 Myr) and ostensibly 
petrographically fresh samples from La Palma, Canary Islands. Two of these sam- 
ples (LPF 96-39 and C820) yielded (4U/?8U) out of equilibrium, warning us against 
using older samples from possibly more extreme weathering environments else- 
where. Nevertheless, these two samples, showing clear open-system U-series beha- 
viour with ?74U/?*8U) ~2% lower than secular equilibrium, are still within error of 
the other La Palma samples for 3°°°U, and so we did not exclude these data from 
our averages. This also indicates that 8°*°U is not hugely sensitive to minor pertur- 
bations of the U budget. 

In terms of traditional radiogenic isotope characterization, the islands we have 
studied cover high 3He/*He (Hawaii, Iceland), HIMU (La Palma, Canaries), EMII 
(Sao Miguel, Azores) and FOZO (Pico, Azores, and Fogo, Cape Verde) ‘flavours’ of 
mantle signature**>*. Although La Palma is not as radiogenic in its lead isotope 
ratios as the classic French Polynesian and St Helena localities, the latter have 
suffered ~10 Myr of tropical weathering, and so are far from ideal for characteriz- 
ing primary U isotope ratios. We have not measured any representative samples 
from EMI-type mantle, but nevertheless cover a large compositional range of OIB. 
5?>°U in MORBs. We have measured eleven glassy, axial or near-axial MORB sam- 
ples from all three major ocean basins: the Indian (n = 1), the Atlantic (n = 3) and 
the Pacific (n = 7). Further details on these samples are given in Supplementary 
Table 1 and associated references**°. All picked glasses were leached to remove 
possible absorbed U on ferro-manganese coating, as discussed earlier. The eleven 
MORB glasses have Th/U ratios of 2.1 to 2.6 and all have ?4uU/?3*U) within a few 
per mil of secular equilibrium, except the already discussed Atlantic Ocean sample 
D18-1. As for the OIBs with perturbed (7**U/?*8U), the 5”°°U does not appear 
strongly affected and D18-1 has 8?8U = —0.265%o + 0.030%o, identical to the 
mean 87°*U of all MORB; we thus did not exclude this data point from our aver- 
ages. This observation is compatible with a mass balance calculation to account for 
the observed (7**U/?*8U) disequilibrium of D18-1 assuming the contaminant has 
a U isotope composition similar to seawater. Adding seawater with (7*4U/??*U) of 
1.146 to MORB at secular equilibrium should result in a change in 5”*°U of less 
than 0.02%o given a seawater 8°°8U of —0.39%o. 
8*°U in island arc volcanics. A suite of nine mafic samples from the Mariana arc 
front** was selected to investigate subduction zone processes (Supplementary Table 1). 
These well-characterized samples show variable subducted sediment input to their 
sources, combined with a rather constant flux of ‘fluid’ from the subducting, mafic 
oceanic crust”®. In more detail, it has recently been argued that the sediment com- 
ponent evident in the arc lavas is dominated by the volcaniclastic horizons rather 
than representing an average of all lithologies, in which pelagic clay has a significant 
role”**', Samples with small sediment contributions, as marked by high '‘*Nd/'“Nd 
and low Th/Nb, have low Th/U and high ?8U/?°Th), implying a recent, slab- 
derived U addition to their mantle source. The systematic compositional variations 
of these lavas allow us to extrapolate to the possible 8”*°U of this slab-derived fluid, 
using a best-fit linear regression line through the data in the Th/U-8°*°U space, as 
shown in Fig. 1. 
3°**U of subduction zone inputs. Subducted crust can be separated into three 
principal, chemical components: unaltered oceanic crust, mafic AOC and sedi- 
ments. Here we analysed sediments and AOC from well-characterized deep-ocean 
drill holes*?>“?, ODP Site 801 and ODP Site 802 in the west Pacific, to assess the U 
budget of subduction-related material. Not only does the former location provide 
the best opportunity to assess the mean composition of the old AOC (~170 Myr), 
but because these locations are in front of the Mariana arc, the composition of the 
overlying sediments are specifically appropriate as the endmember for the Mariana 
arc lavas. 

Typical assemblages for the deeper ocean sediment package that are subducted 
include volcaniclastics, pelagic clays, cherts, carbonates and Fe-Mn crusts. The U 
concentrations for these materials are variable in the 0.1-10 pg g_' range. Modern 
seawater has a U concentration of ~3.2ng g' and a homogeneous 87*°U of 
—0.390%0 + 0.010%o0 (Supplementary Table 1). Biogenic carbonate appears to 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


incorporate U from seawater without any significant isotope fractionation”. The 
measured deep-sea pelagic clays and volcaniclastics are close to the seawater 5*°*°U 
and the bulk Earth value (— 0.42%o to —0.28%o; Supplementary Table 1). Further- 
more, the ODP Site 801 composite sample ‘801SED’, meant to reflect an average of 
the infilling material between the pillow basalts within the basement (comprising 
chert, hydrothermal deposit, calcite and clay minerals) has a 8°8U similar to sea- 
water. Thus, the Mariana and indeed most subducting sediment packages have an 
average 5°*°U close to the values for modern seawater and bulk Earth. 

The most important source of ‘U excess’ in subduction zones is the AOC. The 
fluid-induced alteration in oceanic crust can generally be classified into high- 
temperature (>100 °C) and low-temperature (<100 °C) types. 

The high-temperature alteration generally occurs close to the spreading ridge 
axis and at greater depth in the crust by percolation of hot hydrothermal fluids®*®. 
Any seawater-derived U uptake in these settings is assumed to be quantitative (see, 
for example, ref. 66); however, the deeper sections of the crust (> 1,000 m) affected 
by high-temperature hydrothermal circulation are generally little altered and have 
low U concentrations close to typical MORB® (for example 0.07 jig g_'). Thus, the 
high-temperature alteration at greater depth does not appear to add a significant 
amount of U compared with the shallower, low-temperature alteration”. 

The low-temperature alteration (<100 °C) dominates at ridge-flanks with per- 

colation of less intensely heated seawater®*’, and the uppermost 500 to 1,000 m 
of the mafic oceanic crust experiences significant U addition*** > with a mean 
fivefold-greater U content relative to the unaltered MORBs’*. The low-temperature 
alteration is therefore the cause of most uptake of additional U in the subducting 
plate. Thus, we have focused our attention on characterizing this low-temperature 
alteration using average, ‘composite’ samples (see below). 
Altered, mafic oceanic crust. For estimating the U concentration and &°*°U budget 
of altered oceanic crust, we have made high-precision 5”**U analyses of ‘compo- 
sites’ from the upper ~500 m ofaltered extrusive lavas at the well-studied ODP Site 
801. This represents a substantial section of the extrusive lavas erupted at a fast 
spreading centre. Secondary alteration products from hydrothermal seawater 
flow-through suggest alteration temperatures from 10 to 100°C, increasing with 
depth. A typical alteration sequence consists of oxic celadonite formation around 
alteration veins, followed by Fe-hydroxides and then reducing saponite and pyrite, 
in a zone moving away from the alteration veins and into the host rock®. Carbo- 
nate precipitates also occur, which may have formed intermittently through time*”. 
Uranium enrichments are evident in breccia zones and in relation to redox haloes, 
with U concentrated at the boundary between oxidized (celadonite-rich) and 
reduced (saponite/pyrite-rich) zones moving away from the alteration veins, in 
a roll-front-redox-type U deposition pattern*”. These redox haloes dominate the 
deeper part of the drilled section®™. In ref. 8 it is estimated that about 50% of the 
total U excess is hosted in the secondaryily formed carbonates and that the remain- 
der is associated with the redox haloes. 

From the main (tholeiitic) ~420 m alteration zone, three suites of composite 
samples from different depth ranges (0-110 m, 110-220 m, 220-420 m) have been 
prepared to average the composition of the heterogeneously altered sections in the 
crust”. The composites are physical mixtures of powders in relative proportions of 
their abundances throughout the particular section of core, and are intended to phy- 
sically represent the bulk composition of various depth domains within the drilled 
sequence. For each of the three composite zones three different powder mixtures 
were prepared: ‘FLO’ composites represent the least-altered material, “VCL’ the 
most altered material and ‘MORB’ composites represent the bulk (mixtures of 
the FLO and VCL composites)”. Furthermore, a “supercomposite’, comprising an 
integration of the full upper 420 m of core, was made”. All the composite samples 
have low Th/U ratios (0.1-0.6), and high U concentrations of ~0.4 1g g (ref. 25). 
The U concentration of the 801 supercomposite (0.39 1g g *) is similar to the 
DSDP 417/418 supercomposite (0.3 gg *) and significantly higher than estimated 
unaltered MORB® (0.05 jig g *). 

The 8°°U was measured in the three composite sections (in all three FLO, VCL 
and MORB powder mixtures), the supercomposite and three individual samples. 
The 5***U in the composite samples are variable through the ~420 m sequence: the 
upper ~110 m averages —0.436%0 + 0.042%o; the middle ~ 110 m are significantly 
heavier, averaging +0.164%o + 0.086%o; and the lower ~200 m are in between, 
averaging —0.145%o + 0.045% (Supplementary Table 1). The supercomposite sam- 
ple yielded a 8°*°U of —0.170%o + 0.026%o. In addition to the composite samples 
we analysed three single samples from different depths: (1) a capping alkali basalt 
(—0.333%0 + 0.044%o) from ~26 m above the ‘start composite depth’ of the altered 
crust section; (2) an altered MORB (—0.341%0 + 0.044%bo) in the upper composite 
section (~—100 m); and (3) a calcitic breccia (—0.114%o + 0.044%o) from the lowest 
composite zone (~—320 m). The differences in 5”**U between the latter two, nor- 
mal, individual altered oceanic crust samples are reassuringly consistent with the 
composites, with the shallower sample showing significantly lower 5*°*U than the 
deeper one. The alkali basalt sample has a high U content (0.7 p.p.m.), presumably 


reflecting its primary composition, which will be much less influenced by second- 
ary U addition than MORB. Alkaline volcanism is atypical of oceanic crust stra- 
tigraphy but is a feature of some West Pacific drill sites, believed to be part of the 
burst of plume volcanism in the Cretaceous”. Fittingly, this alkali basalt sample 
has a 8°*°U similar to other OIB (Supplementary Table 1). 

The variable 57°U, at values distinct from seawater, shows that the seawater- 
derived U is not quantitatively incorporated during alteration in the ODP Site 801 
AOC. During U uptake involving no redox transition, U isotopes generally appear 
to yield similar or slightly lower 5°°5U values'®'”%°, However, during the U(v1) 
to U(1v) reduction process U isotope fractionation is governed by both the nuclear 
field shift and mass-dependent mechanisms’. These processes lead to U isotope 
fractionation, but in opposite directions, with the nuclear field shift dominating 
the total observed ***U/?*°U fractionation and leading to a preference for the heavy 
isotope in the reduced immobile U(iv) form'*!*’"”. Such shifts towards higher 
&8y, during redox-driven U uptake, have been documented in natural environ- 
ments including U-enriched reducing sediments'’*”? and redox-driven roll-front 
U ore deposits’””*”*. With mass balance considerations in mind, it is clear that the 
U incorporation constitutes a partial reduction process, because a complete reduc- 
tion of the available U would result in no net isotopic fractionation. This implies a 
process in which U is partitioned between U(v1) and U(iv) species but only the 
latter is fixed, and left immobile, with the former being transported away in the per- 
colating fluid. Such a loss of isotopically light U during a partial U reduction pro- 
cess preferentially taking up heavy U isotopes has been shown in groundwaters 
associated with roll-front U ore deposits”, in situ bio-stimulated U reduction flow- 
through experiments” and the anoxic Black Sea water column”. 

For the AOC at ODP Site 801, the 57°°U lower than the seawater composition in 
the upper 100 m may be expected from a dominant oxic U uptake, through adsorp- 
tion, consistent with relatively oxidized conditions and high water/rock ratios”. A 
change to higher 5°**U in the lower part of the AOC is in accordance with general 
U addition through a reductive process and a U(v1)-to-U(1v) transition in the deeper 
part of the AOC with more restricted seawater flow-through”. Furthermore, the 
loss of isotopically light U to the upper part of the crust will mean that fluids per- 
colating deeper may anyway have higher 5”*°U signatures than the seawater com- 
position. Both the lower and the middle part of the altered mafic crust have 8°*°U 
higher than seawater, with the highest 8°°°U found in the middle part. This obser- 
vation may be related to the basement structure at ODP Site 801, with variable 
permeability and, hence, through-flow of seawater and, consequently, hetero- 
geneous U addition throughout the AOC. From the deposition of brown oxic 
haloes throughout the ODP Site 801 AOC, most oxic seawater through-flow has 
been estimated to occur in the upper 150 m and below 300 m depth in the core®, 
and, consequently, the most reducing conditions are in between. This may explain 
why the highest 8”*°U is found in the middle part, as U incorporated through U 
reduction is more dominant in this zone, compared to U uptake from oxic adsorp- 
tion in the upper and lower sections. 

Strikingly, the measured 8°*°U and the U concentrations are very similar for the 
least-altered (FLO) and most-altered (VCL) material in each of the composite sections. 
This suggests that it is not the degree of alteration that dictates the U incorpora- 
tion, but the ambient conditions. The relatively high 5”°°U of the supercomposite 
suggests that the reduced U in the deeper part of the AOC dominates the overall 
3°*°U signature. 

Assuming that such roll-front redox U uptake, as seen in the ODP Site 801 AOC, 

is representative of modern AOC U uptake and represents the integrated modern 
AOC for subduction, it delivers a high 3**°U to the mantle. In more reduced con- 
ditions U is expected to be taken up more quantitatively, resulting in little net isotopic 
fractionation of the added U. Such a scenario may be expected to describe the alter- 
ation of mafic oceanic crust from the percolation of anoxic seawater which dominated 
the deeper ocean before the second rise in atmospheric oxygen ~600 Myr ago'*””. 
This scenario would suggest insignificant U isotopic fractionation for U uptake 
into AOC before ~600 Myr ago, yielding a 8°*°U similar to the mean 8°°U 
composition of rivers, the major U input into the ocean. At present, the best esti- 
mate of the modern riverine 5°*°U flux to the ocean is —0.24%o (ref. 78), close to 
our bulk Earth estimate. This suggests that U is released near-congruently with 
little net U isotope fractionation during oxidative terrestrial weathering and riv- 
erine transport. Assuming near-congruent U release during terrestrial weathering 
since the GOE ~2.4 Gyr ago, and oxidation of the atmosphere, quantitative uptake 
of U into the AOC would then imply a 8°*U composition near bulk Earth in the 
period ~2.4 Gyr to ~600 Myr ago. 
Th-U-Pb systematics of OIB. The database. Our simple model of U recycling 
predicts that samples derived from increasingly young mantle sources will have 
increasingly subchondritic Th/U. To test this prediction we compiled data from the 
literature for samples, which have been analysed for both Th and U concentrations 
and Pb isotope compositions (Extended Data Table 1). The latter provide model age 
constraints, as detailed below. 
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To minimize the effects of analytical problems and secondary weathering pro- 
cesses obscuring primary signatures, we placed quite selective criteria for inclusion 
of samples into the data set, as follows. 

(1) We included only samples with mass-spectrometric isotope dilution data on 
Thand U concentrations, coupled with U-series disequilibrium data. This ensures 
high-precision Th/U data and provides additional information on the magnitude 
of possible perturbation by the melting process (see below). Moreover, because 
samples collected for U-series data are all young, this also guards against U pertur- 
bation during weathering. As discussed above, mobility of U during weathering is 
otherwise a significant concern. 

(2) We have selected only data from the main, shielding-building phases of 
islands. Later, post-erosional lavas are frequently invoked to contain a lithospheric 
component”, and thus do not reflect the deep source we seek to investigate. 

(3) In cases where several data sets exist for the sample location, we select the one 
containing the techniques more likely to be robust. Thus, in the case of the Azores, 
we use the data from ref. 81 rather than ref. 82, because the MC-ICPMS Pb data of 
the former provide much less-scattered model ages than those obtained from TIMS 
measurements of the latter. 

Our database (Extended Data Table 1) thus comprises analyses from Hawaii, 
Iceland, Canary Islands (La Palma), Azores (Pico and Sao Miguel), Society Islands 
and Samoa. We also include a composite data point for Reunion derived from sepa- 
rate U-series and Pb isotope studies of historic eruptions. Although these studies 
are dominantly on different sample suites, the well-documented, extreme isotopic 
homogeneity of historic Reunion magmatism* gives us the confidence to combine 
the mean values of Th/U and Pb isotopes. There is one island (Pitcairn) for which 
appropriate data exists according to our criteria, but which we have not plotted for 
the following practical reasons. The very unradiogenic Pb isotope ratios of this 
island yields negative model ages in our calculations and so cannot be plotted to- 
gether with the other data. This, combined with their extremely high Th/U, sug- 
gests that additional processes are responsible for the striking characteristics of this 
EMI-type composition, for example the erosion of deep continental crust**. In all, 
our compiled OIB database covers a similarly wide range of isotopic character- 
istics as represented by samples analysed for 5”**U (Fig. 1), and so forms a fitting 
complement. 

In Fig. 3 we plot each individual datum from our compilation as a point, to show 
the range of compositions. To emphasize the contrasting mean compositions of 
different islands we have also averaged individual samples from a given island. A 
comment is required about the averaging of the Azores samples. The island of Sao 
Miguel has a marked spatial isotopic heterogeneity, with a distinct geographic (west- 
to-east) variation. Thus, we have added samples from the western volcanic centre 
(Sete Cidades) to the Pico samples to represent ‘normal’ Azores (I) while the other, 
more easterly samples are averaged to give ‘enriched’ Azores (II). 

Pb model ages. It has long been known that a slope on the plot of *°°Pb/?™Pb 
versus *°’Pb/?™*Pb potentially has age significance (see, for example, ref. 85). This 
approach was used to some effect in ref. 10, using the linear arrays in *°°Pb/?™Pb 
versus ~’’Pb/?™Pb defined by some OIBs to calculate isochron ages of their sources, 
which ranged from 1 to 2.5 Gyr. Here we follow a similar approach. However, we did 
not want to rely on islands yielding well-defined linear arrays in Pb isotope space. 
Instead, we calculate the model ages of individual points rather than the slope of an 
array of data. Both approaches assume a common first stage for all samples. From 
this evolving reservoir a secondary model age (tm) and U/Pb (12) are calculated to 
produce the modern Pb isotopic composition. The parameters of our first-stage 
evolution are given in Extended Data Table 2 together with other input values. 

The Pb model age we calculate represents an event that increased U/Pb to gen- 
erate modern Pb isotopic compositions that lie to the right of the geochron. As 
discussed widely (see, for example, ref. 86), the process of subduction provides an 
appealing physical manifestation of this model scenario. During subduction, dehy- 
dration preferentially removes Pb from the mafic crust*’, increasing the U/Pb and 
Th/Pb of the deep subducted residue. Thus, we believe that the model ages relate to 
the time of subduction of recycled oceanic crust found in OIB sources. 

Explicitly, we calculate our model ages by rearranging and numerically solving 
the following two equations (1) and (2) for 2 and ty (using parameters described 
in Extended Data Table 2): 


206 Pb 206 Pb 


he ogg tm osstm 
4PH 20 Pbop Fhe ei (e—1) (1) 
207 ph 207 ph 
Hy ( elasst chaste) Hy ( ehasstn 1) (2) 
204ph 204Phqy | 137.88 137.88 


Having obtained jy and ty a model Th/U (weight ratio) of the second stage may 
then be calculated accordingly by rearranging equation (3): 
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Extended Data Table 1 contains the averaged, calculated model Pb ages for our OIB 
data set, whereas the individual model Pb ages versus measured Th/U and modelled 
Th/U (Pb, two-stage) are shown in Fig. 3. We use Th/U (weight ratio) throughout 
as this is most commonly reported in the literature, although *’Th/?*°U (atomic 
ratio, «) is required in the calculations (the difference between these two ratios is 
not great). The measured Th/U is potentially perturbed by melting or during melt 
migration to the surface, or both. For this reason, we used only samples for which 
U-series measurements were available, which provides direct constraints on the 
magnitude of this process. All samples in our data set have **°Th excesses, from 
1% to 37% (Extended Data Fig. 3). This implies no more than a 37% increase in Th/ 
U during melt generation and likely less (see, for example, the discussion in ref. 88). 
As in the case of MORB, recent melting cannot explain the trend to lower Th/U 
from values close to the planetary reference (3.876). 

The Samoan samples are notable for having Th/U higher (4.0-5.3) than the 
planetary reference value. This cannot be solely a result of recent melt fractionation 
because these samples have minor °Th/?7*U) disequilibrium (Extended Data 
Fig. 3). This high Th/U is potentially associated with lithospheric enrichment from 
plume-derived carbonatitic metasomatism, which can fractionate Th/U but will 
not influence the Pb isotopes”. We note that the Samoan Pb isotopes are incom- 
patible with their high Th/U being associated with ancient fractionation; that is, 
they have model Th/U within error of the planetary Th/U. Although the Samoan 
source has long been associated with recycled continental sediments”, if these were 
recycled before the major rise in atmospheric oxygen (as is compatible with their 
model ages), then the Th/U of the continental material should be unfractionated”’””. 
Consequently, whether or not the enriched component is continental is not a critical 
issue. We also stress that the overall trend in Fig. 3 is not pinned by Samoa, but 
includes Réunion and the enriched samples of Sao Miguel. For the latter, there has 
been a detailed discussion of why recycled sediment is not implicated in this source”*. 

The apparently continuously declining Th/U of OIBs and their typically higher 
values than MORB argues against excess U left residual in the subducting slab hav- 
ing a significant role in lowering Th/U of OIBs. Rather, we infer that the slab adds 
its excess U, from sea-floor alteration, to the upper mantle. As hypothesized in the 
main text this is likely to result from its mineralogical host becoming unstable 
during pro-grade metamorphism. A host such as allanite”” would survive beyond 
the subduction zone, but would ultimately melt to transfer U into the surrounding 
mantle. Thus, we infer that the declining Th/U of OIBs reflects the steadily decreas- 
ing Th/U of the upper-mantle source, which forms crust subsequently recycled to 
produce further OIBs. In this model, the upper mantle always has lower Th/U than 
previously formed OIB sources. The need for the excess U to be lost from recycled 
oceanic crust has also been discussed in terms of the Pb isotope systematics of OIB 
(see, for example, refs 93, 94). 
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Extended Data Figure 1 | 5”°*U reproducibility of standards. Repeated 
3”°5U measurements of a range of standards with different matrixes (CZ-1 
uraninite, BHVO-2/LP 45 E basalts, seawater) are shown. All have external 
reproducibility (2 s.d., grey shaded area) better than +0.30%o, a similar range to 
the internal measurement uncertainty (2 s.e.) for individual samples 
(Methods). The different symbols refer to the different measurement set-ups 
(Supplementary Table 4). 
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Extended Data Figure 2 | U-Th geochemistry of analysed meteorites. 

a, 8°8U versus U concentration for ordinary chondrites (black diamonds, 
‘finds’; red diamonds, ‘falls’). b, 87°°U versus (7°4U/?**U) for ordinary 
chondrites (symbols as in a) and eucrites (blue circles). ¢, &?°U versus Th/U for 


the same samples as in a and b. d, A ‘Caltech plot’ of the 8°°°U of individual 
meteorite samples and averages based on (1) the only two meteorites with 
(34U/?8U) within error of secular equilibrium (‘Mean (Z+J)’) and (2) all of 
the analysed meteorites (‘Mean all’). Error bars denote 2 s.e.m. 
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Extended Data Figure 3 | U-Th isotope systematics in the OIB used for Pb —_ (8) Réunion. References can be found in Extended Data Table 1. Note that the 

age modelling. Symbol colours are as in Fig. 3: (1) Hawaii, (2) Iceland, y axis shows activity ratio whereas the x axis shows a weight ratio. The dashed 

(3) Azores I, (4) La Palma, (5) French Polynesia, (6) Samoa, (7) Azores II, line represents secular equilibrium of (°Th/?38U). 
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Extended Data Table 1 | Literature compilation of Pb, U and Th in Ocean Island Basalts 


N* Locality Ref. Pb Ref. Th/U nt age (Ga)+ 
1. Hawaii (Kilaeua) (95,96,97) (95,96,97) 3 1.76 
2; Iceland (98) (99) 35 1.81 
3. Azores | (Sao Miguel/Pico) (81) (100,101) 10 1.89 
4. Canaries (La Palma) (102,103) (102) 8 1.92 
5. French Polynesia (104) (104) 11 242 
6. Samoa (105) (105) 13 2.25 
ra Azores II (Sao Miguel) (81) (100) 13 2.33 
8. Réunion (106) (107) (average) 2.42 


‘Locality number used in main text Figure 3 
tNumber of individual data-points 


+ Average Pb model ages (tn) for each locality, see methods 
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Extended Data Table 2 | Input parameters for calculating Pb model ages 


Initial composition*: 


2°SBb/*™*Pb (Canyon Diablo): 
?°7Bb/**Pb (Canyon Diablo): 


2°8Bb/**Pb (Canyon Diablo): 


uy (1. Stage 8u?™ Pb): 
Th/U, (1. Stage): 
t (time ago, Ga): 


238 /235Ly- 


Decay constants (y"): 
eae (**U): 
Deas ("°U): 


dasa (Th): 


9.3066 
10.293 
29.475 
7.85 
3.876 


4.57 


137.88 


1.03326£ 


1.551E-10 
9.849E-10 


4.948E-11 


* Radioactive element abundances and ratios are reported as present day values 


+The “old consensus value” is used to be comparable with literature data 


+k is the conversion factor of Th/U weight ratios to atomic ratios of *’Th/**U (or kappa). We use Th/U (weight ratios) throughout the text for consistency with most 


literature 


©2015 Macmillan Publishers Limited. All rights reserved 


| sd Wal Be 


doi:10.1038/nature13864 


Promoterless gene targeting without nucleases 
ameliorates haemophilia B in mice 


A. Barzel', N. K. Paulk!, Y. Shi?, Y. Huang’, K. Chu', F. Zhang", P. N. Valdmanis!, L. P. Spector, M. H. Porteus®, K. M. Gaensler” 


& M. A. Kay! 


Site-specific gene addition can allow stable transgene expression for 
gene therapy. When possible, this is preferred over the use of pro- 
miscuously integrating vectors, which are sometimes associated with 
clonal expansion’ and oncogenesis’. Site-specific endonucleases that 
can induce high rates of targeted genome editing are finding increas- 
ing applications in biological discovery and gene therapy’. However, 
two safety concerns persist: endonuclease-associated adverse effects, 
both on-target* and off-target”*; and oncogene activation caused by 
promoter integration, even without nucleases’. Here we perform re- 
combinant adeno-associated virus (rAAV)-mediated promoterless 
gene targeting without nucleases and demonstrate amelioration of 
the bleeding diathesis in haemophilia B mice. In particular, we tar- 
get a promoterless human coagulation factor IX (F9) gene to the liver- 
expressed mouse albumin (AIb) locus. F9 is targeted, along with a 
preceding 2A-peptide coding sequence, to be integrated just upstream 
to the Alb stop codon. While F9is fused to Alb at the DNA and RNA 
levels, two separate proteins are synthesized by way of ribosomal skip- 
ping. Thus, F9 expression is linked to robust hepatic albumin ex- 
pression without disrupting it. We injected an AAV8-F9 vector into 
neonatal and adult mice and achieved on-target integration into 
~0.5% of the albumin alleles in hepatocytes. We established that F9 
was produced only from on-target integration, and ribosomal skip- 
ping was highly efficient. Stable F9 plasma levels at 7-20% of normal 
were obtained, and treated F9-deficient mice had normal coagula- 
tion times. In conclusion, transgene integration as a 2A-fusion to a 
highly expressed endogenous gene may obviate the requirement for 
nucleases and/or vector-borne promoters. This method may allow 
for safe and efficacious gene targeting in both infants and adults by 
greatly diminishing off-target effects while still providing therapeutic 
levels of expression from integration. 

Site-specific gene targeting is one of the fastest growing fields in gene 
therapy and genome engineering. The rise in popularity of gene target- 
ing can be attributed in large part to the development of readily engi- 
neered and easy to use site-specific endonucleases (for example, TAL- or 
CRISPR-based)’ that can increase rates of gene disruption, gene correc- 
tion or gene addition by as much as four orders of magnitude. However, 
these endonucleases may have significant adverse effects including im- 
munogenicity, uncontrolled DNA damage response, off-target cleavage 
and mutagenesis, induction of chromosomal aberrations, as well as off- 
target integration of the transgene and endonuclease vectors (if DNA- 
based)**. When a vector-borne promoter, driving expression of the 
therapeutic transgene or and/or the nuclease, is integrated either on- or 
off-target, it may lead to undesired activation of nearby genes, includ- 
ing oncogenes. The use of endonucleases in vivo would require their 
vectorization, delivery and expression in a transient manner to minimize 
long-term side effects. It is unclear how integration of the vectored en- 
donuclease gene could be strictly avoided. 

Our promoterless, endonuclease-independent method harnesses the 
efficient transduction, favourable safety profile and high gene target- 
ing rates associated with rAAV*”, as well as the robust liver-specific 


expression of the Alb locus’’. Different rAAV serotypes can efficiently 
transduce various cell types in vitro or in vivo, while other serotypes 
have been designed or selected for desired phenotypes'*"’. rAAV is in 
use in several clinical trials'*”°. Notably, rAAV transduction allows high 
gene targeting rates in vitro® and in vivo’”°. The increased recombination 
rates may be due to the viral inverted terminal repeats, the encapsida- 
tion of single-stranded DNA, or the timing and subcellular localization 
of capsid uncoating. 

The safety of rAAV stems from its lack of pathogenicity, as well as 
being devoid of viral genes. Nevertheless, non-targeted genomic integ- 
ration of rAAV occurs at a low but notable rate, and there are reports of 
such integrations inducing hepatocellular carcinoma in mice’. Trans- 
formation was attributed to vector integration at a chromosome 12 locus 
coding for imprinted genes and small RNAs. Integration of rAAV- 
borne promoters may be the leading cause of aberrant expression, as 
was established for lentiviral and retroviral vector integration leading 
to clonal expansion and oncogenesis’”'. Vector-borne promoters are 
in use in many continuing clinical trials. By contrast, the rAAV used in 
our strategy encodes no promoter, thus diminishing the chance of neigh- 
bouring oncogene activation in rare off-target integrations. 

As proof of concept, we targeted the human F9 gene, deficient in the 
X-linked recessive disease haemophilia B, which affects 1 in 30,000 males. 
Affected individuals suffer from serious spontaneous bleeding owing 
to a deficiency of plasma coagulation F9 produced from the liver. Re- 
constitution with as little as 1-2% clotting factor can considerably im- 
prove quality of life, while 5-20% will markedly ameliorate the bleeding 
diathesis. Here we used the liver tropic rAAV8 serotype to target human 
F9 for expression after integration from the robust liver-specific mouse 
Alb promoter. We postulated that: the Alb promoter should allow high 
levels of coagulation factor production even if integration takes place in 
only a small fraction of hepatocytes; and the high transcriptional activ- 
ity at the Alb locus might make it more susceptible to transgene integ- 
ration by homologous recombination. 

Gene targeting without nucleases should affect only a small fraction 
of Alb alleles in the liver. Nevertheless, we opted to minimize disrup- 
tion and dysregulation of the Alb gene by targeting F9 as a 2A-fusion at 
the end of the Alb reading frame (Fig. 1a). 2A-peptides, derived from 
plus-strand RNA viruses, allow the production of several proteins from 
a single reading frame by means of ribosomal skipping”. This process 
leaves the first translated protein tagged with ~20 carboxy-terminal ami- 
no acids, and the second protein with just one additional amino-terminal 
proline. Functionality of both proteins is typically retained, and clinical 
trials using 2A-peptides did not report immunogenicity”. We used 
single-stranded AAV to target a codon-optimized F9 coding sequence, 
preceded by a sequence coding for a porcine teschovirus-1 2A-peptide 
(P2A)”, to be integrated just 5’ of the Alb stop codon. After integra- 
tion, Alb and F9 are co-transcribed from the strong Alb promoter, and 
should thus be co-regulated at the levels of splicing, nuclear exit, mes- 
senger RNA stability, translation initiation and endoplasmic reticulum 
localization. Two separate proteins are translated, both containing a signal 
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peptide, so that the endoplasmic reticulum-associated translation of 
Alb will be immediately followed by translation and processing of the 
clotting factor for secretion. Finally, to reduce the chance of off-target 
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Figure 2 | Human F9 expression and activity in injected mice. a, Plasma 
F9 measured by ELISA following intraperitoneal injections of 2-day-old B6 
mice with 2.5 X 10’! vector genomes per mouse of either the AAV8-F9 
experimental construct (n = 6) or inverse control (n = 3). The limit of 
detection was 20 ng ml '. PH, partial hepatectomy. Error bars represent s.d. 
Dashed lines denote 5% and 20% of normal F9 levels. b, Plasma F9 measured by 
ELISA after tail vein injections of 9-week-old female B6 mice with 1 x 10’ 
vector genomes per mouse of the AAV8-F9 experimental construct (n = 7), 
or inverse control (n = 3), or a hydrodynamic injection of 30 ug plasmid 

(3.5 X 10!” copy number) coding for the F9 construct in the ‘correct’ 
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Figure 1 | Vector design and experimental scheme. a, The rAAV8 vector 
encodes a codon-optimized human F9 cDNA preceded by a 2A-peptide 
coding sequence and flanked by homology arms spanning the mouse Alb stop 
codon. Length of the 5’ and 3’ arms are 1.3 and 1.4kb, respectively. After 
integration by homologous recombination, Alb and F9 are fused at the DNA 
and RNA levels, but two separate proteins are produced as the result of 
ribosomal skipping. b, With respect to the Alb homology arms, the AAV 
inverse control has F9 inverted along with the 2A-peptide coding sequence, the 
adjacent Alb exon and the preceding splice junction. Thin white lines denote 
Alb introns; dark grey boxes denote Alb exons; white boxes denote P2A; 
white arrows denote F9 transgene; light grey boxes denote extragenic DNA. 
P, proline. 


F9 expression further, our vector has neither an ATG start codon before 
the F9 signal peptide, nor a start codon in the 2A-peptide coding sequence 
or preceding Alb exon. 

First, we performed intraperitoneal injections of 2-day-old C57BL/6 
(B6) mice with 2.5 X 10’! vector genomes per mouse (~1.25 X 10°* 
per kg) ofarAAV8 coding for the human F9 targeting cassette or a vector 
with an inverted cassette, controlling for off-target expression (Fig. 1b). 
The fragment inverted in the control with respect to the Alb homology 
arms includes not only the F9 gene, but also the P2A coding sequence, 
the adjacent Alb exon, and the preceding splice junction. The inverse 
control should not allow notable F9 expression after on-target integ- 
ration, but would allow levels of off-target expression similar to that 
from the experimental construct. We measured plasma F9 protein levels 
each week by enzyme-linked immunosorbent assay (ELISA), starting 
at 4 weeks of life (Fig. 2a). For the experimental group, levels of plasma 
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orientation (n = 3). The limit of detection was 20 ng ml !. Error bars and 
dashed lines as in a. c, Plasma F9 measured by ELISA following tail vein 
injections of 9-week-old female B6 mice with the designated vector dose of 
AAV8-F9 experimental construct (n = 4 for each dose group). Error bars 
represent s.d. d, Measurement of coagulation efficiency by activated partial 
thromboplastin time (aPTT) 2 weeks after tail vein injections of AAV8-F9 at 
1X 10 vector genomes per mouse (n = 5). KO, knockout. Error bars 
represent s.d. e, Western blot analysis for F9 in liver samples from mice injected 
with the AA V8-F9 construct or inverse control. The expected size of human F9 
is 55 kilodaltons (kDa). 
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F9 plateaued at 350-1,000 ng ml ~ 1 which corresponds to 7-20% of nor- 
mal. For the inverse control group, F9 plasma levels were at or below 
the level of detection (20 ng ml), suggesting that in the experimental 
group, F9 expression does indeed originate from on-target integration. 
Importantly, F9 retains the original plasma protein levels after a two- 
thirds partial hepatectomy, a surgical procedure known to reduce epi- 
somal AAV transgene expression by >90%%, further establishing stable 
transgene integration. 

To determine whether liver growth, as seen with neonates, is essen- 
tial for therapeutic levels of gene targeting, we targeted F9 to the Alb locus 
using the same vector in adult mice. Adult B6 mice were injected with 
1X 10’ vector genomes per mouse (~5 X 10! vector genomes per kilo- 
gram) by tail vein with the AAV8-F9 vector, or the inverse control. A third 
group of control mice received hydrodynamic tail vain injections of a 
plasmid coding for the promoterless F9 construct in the ‘correct’ ori- 
entation. For the AAV8-F9 mice group, plasma F9 levels were found to 
be stable at 7-20% of normal (Fig. 2b). Vector injections at lower dose 
led to lower plasma F9 levels without reaching a plateau at the doses 
tested (Fig. 2c). For adults as well as neonates, the F9 plasma levels of 
the inverse control group were at or below the limit of detection. Dimin- 
ished F9 plasma levels were also associated with mice hydrodynamically 
injected with plasmid (Fig. 2b). Thus, targeting is dependent on rAAV 
vectorization. Finally, we performed rAAV injections in adult F9 knock- 
out haemophilia B mice. The functional coagulation, as determined by 
the activated partial thromboplastin time in treated knockout mice, was 
restored to levels similar to that of wild-type mice (Fig. 2d). The F9 bio- 
logical activity correlated with plasma protein levels of 709 + 91 ng ml’, 
similar to levels in wild-type mice (Fig. 2b-d). 

F9 expression from the liver was confirmed by immunohistochemistry 
(Extended Data Fig. 1). Western blot analysis of liver samples detected 
F9 at the expected molecular mass, testifying that ribosomal skipping 
was efficient, and suggesting that both the ELISA and immunohisto- 
chemistry signals correspond to accurately processed F9 (Fig. 2d). 

We opted to quantify targeting rates by quantitative PCR (qPCR). To 
avoid false signals from episomal rAAV, we first amplified a 3’ seg- 
ment of the genomic Alb locus in a manner not affected by presence or 
absence ofan integrated F9 sequence (Fig. 3a and Extended Data Fig. 2). 
The unbiased amplification was made possible by presence of a com- 
mon restriction site at a roughly equal distance 3’ of the stop codon in 
targeted and wild-type alleles. We then used the PCR amplicon as a 
template for two different qPCR assays: one quantifying the abundance 
of targeted Alb alleles, and the other quantifying the abundance of un- 
targeted wild-type alleles. In the liver, only hepatocytes are targeted by 
rAAV8 (ref. 24). Therefore, we conservatively corrected for a 70% he- 
patocyte frequency” and found the rate of Alb alleles targeted by F9 to 
be 0.5% on average for mice injected as either neonates or adults at the 
highest dose (Fig. 3c and Extended Data Fig. 3). Actual rates of target- 
ing in neonates and adults might differ because AAV distribution to the 
liver may vary based on the different methods used for vector infusion. 
We then examined the proportion of fused Alb-F9 mRNAs to wild-type 
Alb mRNAs by comparing two respective qPCR assays performed on 
an unbiased cDNA template (Fig. 3b). The proportion was found to be 
0.1% on average for mice injected as either neonates or adults (Fig. 3c). 
This value tended to be lower than the rate of integration at the DNA 
level, although the difference was not statistically significant. It is pos- 
sible that the production, processing and/or stability of chimaeric Alb- 
F9 mRNA transcripts were reduced compared to wild-type Alb mRNA. 
While AAV8 has been shown to target only hepatocytes in the mouse 
liver”, here we did not rule out the possibility that some integration 
occurred in non-parenchymal cells that do not express albumin. Our 
observed targeting rate is higher than other reports”'*”, and is parti- 
cularly noteworthy in adult mice in which non-proliferating hepato- 
cytes were expected to allow for a low rate of homologous recombination. 
We propose that the high expression rate at the Alb locus and the as- 
sociated chromatin status may contribute to the high rates of target- 
ing. Damage-induced proliferation cannot be strictly ruled out, but no 
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Figure 3 | Rate of Alb targeting at the DNA and RNA levels. a, Assessment of 
on-target integration rate begins using linear amplification with biotinylated 
primer 1 (black), annealing to the genomic locus but not to the vector. Linear 
amplicons are then bound to streptavidinylated beads and washed to exclude 
episomal vectors. Subsequent second-strand DNA synthesis with random 
primers was followed by CviQI restriction digestion. A compatible linker is 
then ligated, followed by two rounds of nested PCR (primers 2-3 in blue, and 
then primers 4-5 in red). CviQI cleaves at the same distance from the homology 
border in both targeted and wild-type alleles, thus allowing for unbiased 
amplification. The amplicons of the second nested PCR then serve as a template 
for qPCR assays with either primers 6-7 (green) or 8-9 (orange). b, For mRNA 
quantification, primers 10-11 or 11-12 were used to generate a cDNA for 
qPCR assays. Shape and fill code as in Fig. 1. c, Black bars represent the targeting 
rate of Alb alleles as the ratio between the abundance of the DNA template 
amplified by primers 6-7 to the abundance of the DNA template amplified by 
primers 8-9, corrected by a factor of 0.7 to account for hepatocyte frequency. 
Grey bars represent the expression rate of targeted Alb alleles as the ratio 
between the abundance of the cDNA template amplified by primers 10-11 to 
the abundance of the cDNA template amplified by primers 11-12. n = 3 for 
each group, error bars represent s.d. 


increase in alanine transaminase (ALT) levels was seen after injection 
(Extended Data Fig. 4). 

AAV genomes may be present in cells as episomes, or as on- or off- 
target integrants. Total vector copy number was assessed by qPCR (Ex- 
tended Data Fig. 5). The minor change in vector copy number after 
partial hepatectomy in mice injected as neonates may suggest that epi- 
somal vectors had already been greatly diluted during normal liver growth 
and development. In which case, vector copy number can be used as an 
approximate lower bound on the rate of off-target to on-target integ- 
ration. However, most importantly, in the absence of a vector-borne 
promoter, F9 should only be expressed from on-target integration. The 
reconstituted high F9 levels after partial hepatectomy (Fig. 2a) support 
this assumption, as only stably integrated transgenes could rebound after 
such a procedure, unlike that seen with transient episomal expression’. 
Lack of notable F9 plasma levels after treatment with the inverse con- 
trol vector further demonstrated reduced off-target expression. We used 
quantitative reverse transcription PCR (qRT-PCR) to assess the ratio of 
fused Alb-F9 mRNAs directly among the total F9 mRNA pool (Fig. 4a). 
The ratio was found to be 1:1 for mice injected as neonates and as adults 
(Fig. 4b). This suggests that F9 is expressed almost exclusively from 


©2015 Macmillan Publishers Limited. All rights reserved 


on-target integration. Indeed, the only specific signal from a northern 
blot with a P2A probe corresponded to the expected fused Alb-P2A-F9 
mRNA (Fig. 4c). Finally, a western blot with an anti-2A-peptide anti- 
body indicated that the 2A-peptide is associated with a single species 
at the expected molecular mass of Alb (Fig. 4d), as would be expected 
only if expression was restricted to on-target integration and followed 
by efficient ribosomal skipping. 

rAAV has become a popular vector for clinical therapy. Although the 
period of transgene expression in adults can last for several years, it is 
not yet known whether lifelong expression, as required for many gen- 
etic disorders, can be obtained with routine promoter-containing vec- 
tors. Episomal expression from AAV vectors is rapidly lost in dividing 
cells, even after just one round of cell division’. This makes it likely that 
diseases that induce regeneration and/or are treated in infancy while 
tissues continue to grow, will have limited durability. Secondary infu- 
sion of an AAV vector will be unlikely to result in a successful trans- 
duction, owing to the robust humoral immunity resulting from primary 
vector administration”’. By contrast, our approach results in vector in- 
tegration that would eliminate loss of expression over time, even in di- 
viding tissues. This relies on the choice of an appropriate AAV serotype 
to avoid neutralization by pre-existing immunity. 

Previous work demonstrating targeting of F9 to a chimaeric locus in 
a transgenic mouse”? relied on the co-expression of nucleases that may 
be associated with immunological and genotoxic side effects. Interest- 
ingly, the same reliance on endonucleases held true even when F8 was 
targeted to the Alb locus in mice and non-human primates”, probably 
because no homology arms were provided and integration relied instead 
on non-homologous end-joining. rAAV has already been used in clin- 
ical gene therapy trials to treat haemophilia B'*. However, the transgene 
in these clinical trials was expressed from a vector-borne promoter that 
might induce oncogene activation, as has been reported in mice’. Mea- 
suring levels of alanine transaminases, we observed no liver toxicity 
with the injection of our human F9 targeting vector (Extended Data 
Fig. 4). However, it remains to be determined whether the transgene 
overexpression associated with our method will lead to toxicity when 
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Figure 4 | Specificity of F9 expression. a, cDNA, produced from reverse 
transcription with a poly-dT primer, served as a template for a qPCR assay with 
either primers 13-14 or 14-15. b, Bars represent the rate of Alb-F9 mRNAs 
to total F9-containing mRNAsas the ratio between the abundance of the cDNA 
template amplified by primers 13-14 to the abundance of the cDNA template 
amplified by primers 14-15. n = 3 for each group, error bars represent s.d. 

c, Northern blot analysis of liver samples with a probe against P2A. The lower 
nonspecific signal corresponds in size to 18S rRNA. d, Western blot analysis 
of P2A from liver samples of mice injected with the AAV8-F9 construct or 
inverse control. P2A is expected to be fused to albumin (66.5 kDa). 
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different therapeutic transgenes are targeted. 2A-induced immunogen- 
icity could not be strictly ruled out, but notably no immune effects were 
reported in clinical trials when vector coding for a 2A peptide was tar- 
geted to lymphocytes”’. Although we found no evidence of off-target 
expression and no ALT increase, the high vector dose we used may lead 
to other undesired outcomes such as increased off-target integration and 
increased immunogenicity. In the future, this could be mitigated by the 
use of AAV serotypes having better tropism and/or by use of hyper- 
active F9 variants. Genetic polymorphisms at the target locus in the 
human patient population may lead to variable therapeutic efficacy owing 
to reduced homology. However, we found that ~95% ofa 1000 Genomes 
Project (http://www. 1000genomes.org) sample of the human population 
have no more than just two haplotypes at the relevant ALB sequence, 
which may enable broad applicability (Extended Data Table 1). Our 
work demonstrates a therapeutic effect for in vivo gene targeting with- 
out nucleases and without a vector-borne promoter. The favourable 
safety profile of our promoterless and nuclease-free gene targeting strat- 
egy for rAAV makes it a prime candidate for clinical assessment in the 
context of haemophilia and other genetic deficiencies”’. More generally, 
this strategy could be applied whenever the therapeutic effect is conveyed 
by a secreted protein (for example, broadly neutralizing antibodies*®) 
or when targeting confers a selective advantage’. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Plasmid construction. A mouse genomic Alb segment (90474003-90476720 in 
NCBI reference sequence: NC_000071.6) was PCR-amplified and inserted between 
AAV? inverted terminal repeats into BsrGI and Spel restriction sites in a modified 
pTRUEF backbone”'. The genomic segment spans 1.3 kilobases (kb) upstream and 
1.4kb downstream to the Alb stop codon. We then inserted into the Bpu10I restric- 
tion site an optimized P2A coding sequence preceded by a linker coding sequence 
(glycine-serine-glycine) and followed by an Nhel restriction site. Finally, we inserted 
acodon-optimized F9 coding sequence into the Nhel site to get pAB269 that served 
in the construction of the rAAV8 vector. To construct the inverse control, we first 
amplified an internal segment from the BsiWI restriction site to the 3’ Nhel restric- 
tion site. PCR primers used had 15 base tails to allow subsequent integration of the 
amplicon into a BsiWI and Nhel cleaved plasmid at the inverse orientation using 
an In-Fusion Kit (Clontech). Primers used were: forward, 5’-ATGCAAGGCACG 
TACGTTATGTCAGCTTGGTCTTTTCTTTGATCC-3’, and reverse, 5’-TTTAG 
GCTAAGCTAGCTTTACTATGTCATTGCCTATGGCTATGAAGTG-3’. Final 
rAAV production plasmids were generated using an EndoFree Plasmid Megaprep 
Kit (Qiagen). 

rAAV vector production and titration. rAAV8 vectors were produced as prev- 
iously described using a Ca3(PO,), transfection protocol followed by CsCl gradient 
purification*’. Vectors were titred by quantitative dot blot as described”’. 

Mice injections and bleeding. Animal work was performed in accordance to the 
guidelines for animal care at both Stanford University and the University of Cali- 
fornia San Francisco. Eight-week-old wild-type C57BL/6 (B6) mice were purchased 
from Jackson Laboratory to serve for adult injections and as breeding pairs to pro- 
duce offspring for neonatal injections. Two-day-old wild-type B6 mice were in- 
jected intraperitonealy with 2.5 X 10'' vector genomes per mouse of rAAV8 (F9 
or inverse) and bled weekly beginning at week 4 of life by retro-orbital bleeding for 
ELISA as previously described’. Adult (9-week-old) wild-type female B6 mice re- 
ceived either tail vein injections of rAAV8 (F9 or inverse, at the designated dose) 
or hydrodynamic injections of 3.5 X 10'? plasmid copies, and were similarly bled 
weekly for ELISA. F9 CD1/B6 hybrid knockout mice were injected as adults with 
1 X 10’ vector genomes per mouse of rAAV8 F9, and bled retro-orbitally two weeks 
after injection for ELISA and activated partial thromboplastin time assays, as pre- 
viously described". 

ELISA. ELISA for F9 was performed as previously described with the following 
antibodies: mouse anti-human F9 IgG primary antibody at 1:1,000 (Sigma F2645), 
and polyclonal goat anti-human F9 peroxidase-conjugated IgG secondary antibody 
at 1:4,200 (Enzyme Research GAFIX-APHRP). 

Partial hepatectomies. The two-thirds partial hepatectomies were performed as 
previously described** with the following minor modifications: no surgical retrac- 
tors were used to maintain an open abdominal cavity, 3-0 Sofsilk wax coated braided 
silk suture (Covidien S-184) was used for knotting desired lobes for resection, and 
6-0 Polysorb polyester suture (Covidien GL889CV 11) was used for closing the peri- 
toneum and abdominal skin. 

Activated partial thromboplastin time assay. The activated partial thromboplas- 
tin time assay was carried out using a SCA2000 veterinary coagulation analyser 
(Synbiotics) according to manufacturer’s instructions". 

Western blot analysis. Western blots for the detection of F9, albumin, P2A and 
actin used the following antibodies: polyclonal goat anti-human F9 peroxidase- 
conjugated IgG primary antibody at 1:20,000 (Enzyme Research GAFIX-APHRP), 
polyclonal rabbit anti-mouse serum albumin IgG primary antibody at 1:40,000 
(Abcam ab19196), donkey anti-rabbit peroxidase-conjugated IgG secondary antibody 
at 1:10,000 (ECL NA-9340), polyclonal rabbit anti-2A peptide primary antibody 
at 1:10,000 (Millipore ABS31), and monoclonal mouse anti-f-actin peroxidase- 
conjugated IgG primary antibody at 1:50,000 (Sigma A3854). 

Northern blot analysis. Northern blots for the detection of P2A-F9 coding mRNAs 
used the following *’P end-labelled anti-2A probe: 5‘-GCCAGGGTTCTCTTCC 
ACGTCGCCGGCCTGTTTCAGCAGGCTGAAATTGGTGGCGCCGCT-3’. 
Assessing rate of Alb locus targeting by qPCR. Amplification of desired genomic 
Alb, but not undesired vector amplification, began by performing linear amplifica- 
tion with the following dual-biotinylated primer: 5’ -/2-biotin/GTCTCTCATTCA 
GAATTCTCGTAATGTTGAAG-3’, annealing outside the arm of homology. Sub- 
sequent second-strand DNA synthesis was followed by CviQI restriction digestion to 
produce fragments of roughly equal size and known sticky ends from both targeted 
and wild-type alleles. Two oligonucleotides were annealed to a CviQI-compatible 
linker. Oligonucleotide sequences were ‘Watson’: 5’-CTGAAGGCTCAGGTTAC 
ACAGGCACGCTCGTAGGAGGTGTTCCAGTTCACCACG-3’, and ‘Crick’: 5'- 
TACGTGGTGAACTGGAACACCTCCTACGAGC/3ddC/-3’. Linker ligation was 
followed by a primary nested PCR, using primers 2: 5'-CTGAAGGCTCAGGT 
TACACAGGCAC-3’, and 3: 5'-GTATTGGTTTCTAGGGTCACCACCCATAA 
G-3’, anda second nested PCR using primers 4: 5’-GCTCGTAGGAGGTGTTCC 
AGTTCACC-3’, and 5: 5'-GGGAGAGTATTAACGTTTATTTTCATTGTGTT 
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G-3'. No nested PCR product was detected in a control without linear amplifica- 
tion. The PCR amplicon served as a template for two different TaqMan qPCR assays. 
To quantify the abundance of wild-type Alb alleles, we used a TaqMan qPCR with 
primers 6: 5'-TGCCTATGGCTATGAAGTGC-3’, and 7: 5'-CTGAGAAGGTT 
GTGGTTGTGA-3’, and TaqMan probe: 5'-TGCAAAGACGCCTTAGCCTAA 
ACACA-3’. To quantify the abundance of targeted alleles we used a qPCR with 
primers 8: 5’-GATACGTGAACTGGATCAAAGAAA-3’ and 9: 5’-CAAATGG 
TTATCAGTCTTGATCG-3’ and TaqMan probe: 5’-CACATCACAACCACAA 
CCTTCTCAGGT-3’. For non-injected controls, no qPCR signal was detected with 
primers 8-9. The abundance of the template for each amplicon was calculated 
using its own standard curve. We calculated the ratio between the abundance of 
the template for the 6-7 primer pair to the abundance of the template for the 8-9 
primer pair. We then conservatively corrected this ratio by multiplying it by a fac- 
tor of 0.7 to account for hepatocyte frequency in the samples”’, as only hepatocytes 
are targeted by rAAV8 in the liver’. 

Assessing rate of F9-containing Alb mRNAs by qPCR. cDNA produced from 
reverse transcription with a poly-dT primer served as a template for two different 
TaqMan qPCR assays. We quantified the abundance of wild-type Alb mRNA by 
qPCR with primers 10: 5-CTGACAAGGACACCTGCTTC-3’ and 11:5’-TGAGT 
CCTGAGTCTTCATGTCTT-3’, and TaqMan probe: 5’-CCACAACCTTCTCA 
GGCTACCCTGA-3’. We quantified the abundance of fused mRNAs by a TaqMan 
qPCR with primers 12: 5'-CCAAGGTGTCCAGATACGTG-3’ and 11: 5'-TGA 
GTCCTGAGTCTTCATGTCTT-3’, and TaqMan probe: 5’-CCACAACCTTCT 
CAGGCTACCCTGA-3’. For non-injected controls, no qPCR signal was detected 
with primers 11-12. For the no-reverse-transcriptase control, no signal was detected 
with primers 10-11 and 11-12. The rate of F9-containing Alb mRNAs was calcu- 
lated as the ratio between the abundance of template for the 10-11 primer pair to 
the abundance of template for the 11-12 primer pair. 

Assessing specificity of F9 expression. cDNA produced from reverse transcrip- 
tion with a poly-dT primer served as a template for two different TaqMan qPCR 
assays. We quantified the abundance of on-target F9 expression using a qPCR with 
primers 13: 5'-CTTGGGCTTGTGCTTCAC-3’ and 14: 5'-AGGATCTTGTTGG 
CGTTCTC-3’, and TaqMan probe: 5’-CGGTACACTCGGCGCTCAGC-3’. We 
quantified total F9-containing mRNA abundance using a qPCR with primers 15: 
5'-GCTCTTGCTGAGCTGGTGA-3’ and 14: 5'-AGGATCTTGTTGGCGTTCT 
C-3', and TaqMan probe: 5’-CGACTGAGGGTCCAAACCTTGTCA-3’. Calcula- 
tions of abundance used a separate standard curve for each pair. No qPCR signal 
was detected for no-reverse transcriptase and non-injected controls. The rate of 
Alb-F9 mRNAs to total F9-containing mRNAs was determined by comparing the 
abundance of template for the 13-14 primer pair to abundance of template for the 
14-15 primer pair. 

Assessing vector copy number. DNA purified from mice liver served as a template 
for two different TaqMan qPCR assays. We quantified the abundance of haploid 
mouse genomes using a qPCR with primers ‘Alb copy number Ref PF’: 5’-CAGAA 
CGGTCTTTCTCGGAT-3’ and ‘Alb copy number Ref R’: 5‘-TCTTCATCCTGC 
CCTAAACC-3’, and TaqMan probe: 5'-CTCAGCCCTGCAGTGCTGCA-3’. We 
quantified the abundance of AAV genomes (episomal or integrated) using qPCR 
with primers 8: 5'-GATACGTGAACTGGATCAAAGAAA-3’ and 9: 5’-CAAA 
TGGTTATCAGTCTTGATCG-3’, and TaqMan probe: 5’-CACATCACAACCA 
CAACCTTCTCAGGT-3’. For non-injected controls, no qPCR signal was detected 
with primers 8-9. The abundance of the template for each amplicon was calculated 
using its own standard curve. Vector copy number per haploid genome was calcu- 
lated as the ratio between the abundance of the template for the 8-9 primer pair to 
the abundance of the template for the Alb copy number Ref F and Alb copy number 
Ref R primer pair. 

Statistics. Statistical analyses were conducted with Microsoft Excel. Experimental 
differences were evaluated by a Student’s one-tailed t-test assuming equal variance 
(except for the ALT measurement, where equal variance was not assumed). 
Liver immunohistochemistry. Liver lobes were collected from mice, rinsed in ice 
cold dPBS, blotted dry, mounted in OCT (Tissue-Tek 4583) and frozen in cryomolds 
(Tissue-Tek 4557) in a liquid nitrogen cooled methylbutane bath. Cryomolds were 
placed at — 80 °C until sectioning. In all samples, a minimum of two liver sections 
from varying depths were cut at 5 um and mounted onto positively charged glass 
slides (Thermo 6776214). Fluorescent staining was performed according to estab- 
lished protocols* with minor modifications. In brief, frozen sections were thawed 
to room temperature, fixed in acetone for 10 min, removed and allowed to air dry. 
Slides were then washed three times in dPBS for 2 min, blocked with 5% donkey 
serum (Santa Cruz sc-2044) for 15 min at room temperature in a humidified cham- 
ber, washed once in dPBS for 1 min and then sections were circled with an Imm- 
Edge pen (Vector Laboratories H-4000). Slides were incubated with 200 l of goat 
anti-human F9 IgG primary antibody (Affinity Biologicals, GAFIX-AP) at 1:100 in 
5% donkey serum for 3 h and then washed three times in dPBS for 2 min. Slides were 
then incubated with 200 pl of donkey anti-goat AlexaFluor 594 IgG secondary 
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antibody (Life Technologies A11058) at 1:400 in dPBS for 30 min and then washed 
three times in dPBS for 2 min. Slides were rinsed with distilled water and mounted 
with 80 ,] ProLong Gold Antifade with DAPI nuclear counterstain (Life Techno- 
logies P36935) and covered with a #1.5 coverslip (Thermo 12-544-G). Fluorescent 
images were taken on a Zeiss Observer.Z1 microscope equipped with a Zeiss Axiocam 
MRc colour camera and Zeiss AxioVision software (version 4.8.2.0). Images were 
overlayed using Adobe Photoshop CS6 software (version 13 x64). Controls in- 
cluded no-primary secondary-only antibody staining, and comparison to positive 
control frozen human liver tissue sections (Zyagen HF-314) and negative control 
frozen untreated mouse liver sections. 

ALT measurements. Serum ALT measurements were performed on mouse serum 
obtained via retro-orbital bleeding using an ALT kinetic measurement kit (Teco 
Diagnostics) compared with a standard curve. AAV8-U6 and H1 promoter short 
hairpin RNA (shRNA) sequences are derived from on shRNA toxicity studies per- 
formed previously”. 

Assessing the distribution of Alb haplotypes in the human population. A selec- 
tion of the Shapelt2 phased haplotypes for the 1000 Genomes Phase 1 integrated 
variant calls, corresponding to a region 1.3-kb upstream and 1.4-kb downstream 
from the human F9 integration site at the Alb stop codon, was downloaded from 


ftp://ftp.1000genomes.ebi.ac.uk/vol1/ftp/phase1/analysis_results/shapeit2_phased_ 
haplotypes/ using the 1000 Genomes Data Slicer tool (available at http://browser. 
1000genomes.org/Homo_sapiens/UserData/SelectSlice). Haplotypes consisting of 
single nucleotide polymorphisms with a substantial frequency of the alternative 
allele in all populations were combined (here, greater than or equal to 45%, whereas 
those excluded were less than 1%) and treated as individual strings for calculating 
population frequency. 
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Extended Data Figure 1 | Human F9 liver immunohistochemistry. From liver, negative control untreated mouse liver, and two sets of representative 
top to bottom, panels show human F9 staining (red) with 4’,6-diamidino-2- stains from mice treated as neonates or adults with AAV8-F9. Original 
phenylindole (DAPI) nuclear counterstain (blue) in positive control human magnification, X200. 
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Step 1: Linear amplification (LAM) 
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Extended Data Figure 2 | Scheme of targeting rate assessment. Assessment 
of on-target integration rate begins using linear amplification with biotinylated 
primer 1 (black), annealing to the genomic locus but not to the vector (step 1). 


Linear amplicons are then bound to streptavidinylated beads and washed 


to exclude episomal vectors (step 2). Subsequent second-strand DNA synthesis 


with random primers (step 3) was followed by CviQI restriction digestion 


Step 2: Binding to beads and wash 


CviQ!I Stop 


CviQlI¥Stop 


Step 4: CviQ/ cleavage 
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Step 8: qPCR 
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(step 4). A compatible linker is then ligated (step 5) followed by two rounds of 
nested PCR amplifications (primers 2-3 in blue (step 6), and then primers 
4-5 in red (step 7)). CviQI cleaves at the same distance from the homology 
border in both targeted and wild-type alleles, thus allowing for unbiased 
amplification. The amplicons of the second nested PCR then serve as a template 
for qPCR assays with either primers 6-7 (green) or 8-9 (orange) (step 8). 
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Standard curve for targeted allele 
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Extended Data Figure 3 | Standard curves for targeting rate assessment by 
qPCR. qPCR standard curves for the targeted allele (primers 8 and 9, 
Fig. 3) and non-targeted allele (primers 6 and 7, Fig. 3). Mass units used 


Standard curve for non- targeted allele 


y = 3.3406x + 36.579 
R? = 0.9997 


6 8 10 12 


are functionally equivalent to molarity because all amplicons used were of 
equal length. 
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Extended Data Figure 4 | Toxicity assessment by ALT measurement. cassette (5 X 10'' of U6 promoter-driven shRNA). Data represent mean of two 
Alanine transaminase levels (ALT) were evaluated 7 days after injection in measurements of four independent mice for each groups. The statistical 
mice injected with AAV8 coding for our experimental vector (1 X 10”) ora _ significance is defined here as having P < 0.05 in a one-tailed t-test between 
negative control coding for a known non-toxic cassette (1 X 10'* of H1 samples of different variance. 


promoter-driven shRNA), or a positive control coding for a known toxic 
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Extended Data Figure 5 | Vector copy number. Vector copy number assessed by qPCR using primers 8 and 9 (Fig. 3). 1 = 7 for mice injected as adults; n = 6 for 


mice injected as neonates and analysed before or after partial hepatectomy (PH). Error bars represent s.d. 
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Extended Data Table 1 | Haplotypes in the human population at the relevant ALB locus as extracted from the 1000 Genomes Project 


Position | 1D | REF | ALT | WAPI | HAp2 | HAPS | HAPs | HAPS | HAP6 | HAP7_| 
74295239 | ro6n004 | oc | oT | oT | ce OT cc Tc Tc te 
ranasss2 | w4o76 | A | Go | a | Go | a tT a} aA |G 
74295567 | 962005 | oc | A Tc OT A A A 
74295758 | wa36766| Go| TT OG TT TO GG TT 
74295823 | wanse7e7 |G | A OT OG | AT GT GG A A 
74287403 
Frequency| | Este | 4.6% | 2.15% | 15% | 137% | 0.09% | 0.05% _| 
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Productivity limits and potentials of the principles of 


conservation agriculture 


Cameron M. Pittelkow!*}, Xingiang Liang”*, Bruce A. Linquist', Kees Jan van Groenigen’, Juhwan Lee’, Mark E. Lundy’, 
Natasja van Gestel*, Johan Six*, Rodney T. Venterea®® & Chris van Kessel! 


One of the primary challenges of our time is to feed a growing and 
more demanding world population with reduced external inputs and 
minimal environmental impacts, all under more variable and extreme 
climate conditions in the future’ *. Conservation agriculture repre- 
sents a set of three crop management principles that has received 
strong international support to help address this challenge”®, with 
recent conservation agriculture efforts focusing on smallholder farm- 
ing systems in sub-Saharan Africa and South Asia’. However, conser- 
vation agriculture is highly debated, with respect to both its effects on 
crop yields*” and its applicability in different farming contexts”. 
Here we conduct a global meta-analysis using 5,463 paired yield obser- 
vations from 610 studies to compare no-till, the original and central 
concept of conservation agriculture, with conventional tillage prac- 
tices across 48 crops and 63 countries. Overall, our results show that 
no-till reduces yields, yet this response is variable and under certain 
conditions no-till can produce equivalent or greater yields than con- 
ventional tillage. Importantly, when no-till is combined with the other 
two conservation agriculture principles of residue retention and crop 
rotation, its negative impacts are minimized. Moreover, no-till in 
combination with the other two principles significantly increases 
rainfed crop productivity in dry climates, suggesting that it may become 
an important climate-change adaptation strategy for ever-drier regions 
of the world. However, any expansion of conservation agriculture 
should be done with caution in these areas, as implementation of the 
other two principles is often challenging in resource-poor and vul- 
nerable smallholder farming systems, thereby increasing the likeli- 
hood of yield losses rather than gains. Although farming systems are 
multifunctional, and environmental and socio-economic factors need 
to be considered’*’, our analysis indicates that the potential contri- 
bution of no-till to the sustainable intensification of agriculture is 
more limited than often assumed. 

To help address global food security challenges, conservation agricul- 
ture holds much promise as ‘an approach to managing agro-ecosystems 
for improved and sustained productivity, increased profits and food 
security while preserving and enhancing the resource base and the 
environment’’’. Conservation agriculture represents a set of three crop 
management principles: (1) direct planting of crops with minimum soil 
disturbance (that is, no-till), (2) permanent soil cover by crop residues 
or cover crops, and (3) crop rotation”®. In recent decades, widespread 
adoption of no-till has occurred over approximately 125 million hect- 
ares, equivalent to 9% of global arable land, with varying degrees of appli- 
cation of the other two conservation agriculture principles*’*. However, 
the impacts of no-till by itself and conservation agriculture on crop pro- 
ductivity remain contested*"’’. Here, we synthesized current scientific 
evidence at a global scale to assess crop yields under no-till in relation 
to implementation of the other two conservation agriculture princi- 
ples, residue retention and crop rotation. 


A comprehensive meta-analysis was performed on data from peer- 
reviewed publications, representing the largest assessment so far on this 
topic. Because not all three principles of conservation agriculture are 
adopted by all farmers*””, studies at a minimum had to include no-till, 
the original and central concept of conservation agriculture, and con- 
ventional tillage treatments (note: minimum-tillage practices were not 
included). Only field experiments containing side-by-side yield com- 
parisons were included in the database (see Methods for study selection 
details). Since conservation agriculture is not necessarily a low-input 
form of agriculture and in fact has been adopted to the greatest extent” 
in countries characterized by highly mechanized, high-input agricultural 
systems, all comparisons were included regardless of input intensity. To 
examine how the effects of no-till changed across the other two princi- 
ples of conservation agriculture, yield comparisons were grouped into 
categories based on the presence or absence of residue management and 
crop rotation practices as determined by information reported in the 
original studies. For each paired yield comparison, no-till and conven- 
tional tillage treatments received the same residue management and 
rotation practices. In total, the database consisted of 5,463 observations 
from 610 studies. 

Overall, we found that no-till negatively impacts crop yields by 5.7% 
(Fig. 1), although under certain conditions it produces yields equivalent 
to or even greater than conventional tillage systems (Figs 2 and 3). To 
limit global agricultural expansion and thereby reduce net environmen- 
tal degradation, enhancing production per unit area through agricultural 
intensification efforts has been identified as a promising approach**!*"*, 
However, our meta-analysis indicates that no-till is limiting rather than 
enhancing global crop production and sustainable intensification efforts. 
Certainly, yield is only one component of agricultural systems, and there 
is an urgent need to optimize farming practices across other environ- 
mental and socio-economic performance indicators’’*. We recognize 
that in many, but definitely not all situations, continuous no-till along 
with the other two conservation agriculture principles may represent a 
more profitable management system (often because of reduced energy/ 
diesel costs related to tillage), with the potential to improve soil quality 
and provide greater ecosystem services'*'”””. In addition, as agricultural 
crop yields are variable in time and space, yield outcomes can be dif- 
ficult to predict at the individual farm-scale. 

Importantly, the negative impacts of no-till are minimized when both 
of the other conservation agriculture principles are also applied (—2.5%) 
(Fig. 1). The largest yield declines occur when no-till is implemented alone 
(—9.9%) or with only one other conservation agriculture principle (—5.2 
and — 6.2% for residue retention and crop rotation, respectively). To help 
close the yield gap with conventional tillage, these findings suggest that 
instead of implementing no-till as the first step towards conservation 
agriculture in cropping systems where residue retention and crop rota- 
tion are absent (and anticipating that these two principles will follow in 
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Figure 1 | Comparison of yield in no-till versus conventional tillage systems 
in relation to the other two principles of conservation agriculture. 

Results are shown for the entire data set (overall) and for subcategories of 
studies which indicated the presence or absence of residue retention and crop 
rotation for both no-till and conventional tillage systems: +RR+CR (residue 
retention + crop rotation), +RR (residue retention), +CR (crop rotation), 
or -RR-CR (without residue retention or crop rotation). The number of 
observations and total number of studies included in each category are 
displayed in parentheses. Error bars represent 95% confidence intervals. 
Significant differences between categories are indicated by P values based on 
randomization tests. 


time), the primary focus should be on implementing no-till systems that 
already employ the other two principles. This conclusion has impor- 
tant implications for the promotion of conservation agriculture as an 
agricultural development strategy in areas where the former is common, 
including areas of sub-Saharan Africa or South Asia’’”"’. Because resi- 
due retention and crop rotation are generally considered good agro- 
nomic practices, a reconsideration of the order in which conservation 
agriculture principles are introduced in these regions (that is, better 
targeting of no-till to systems already based on the other two conserva- 
tion agriculture principles) is not in conflict with general recommenda- 
tions for sustainable crop production. 

Our analysis, synthesizing information from hundreds of field trials 
across 48 crops and 63 countries (Extended Data Fig. 1), shows that no- 
till significantly enhances yields (7.3%) under rainfed agriculture in dry 
climates when the other two conservation agriculture principles are also 
implemented (Fig. 2a). Yet, the reverse is true when no-till is applied 
alone (—11.9%). Furthermore, yields decrease with no-till regardless 
of whether the other principles are applied in humid climates (Fig. 2b). 
These results are consistent with smaller data sets (for example, 26 studies 
on no-till rainfed maize) in which residue retention in semiarid environ- 
ments and crop rotation positively impacted yields’. A yield benefit with 
no-till in combination with the other two conservation agriculture prin- 
ciples in dry climates is probably because of improved water infiltration 
and greater soil moisture conservation®”®. We found that when water is 
non-limiting owing to irrigation, no-till in dry climates maintains yields 
similar to conventional systems (residue retention + crop rotation mean 
effect size for 34 studies and 213 observations: — 3.0%; 95% confidence 
interval: —6.2 to 0.4%), providing further support for this conclusion. 

To help meet current and future crop production challenges, our results 
suggest that this set of integrated management practices can provide 
agronomic benefits in water-limited and/or water-stressed regions. This 
is an important finding given that millions of hectares in dry climates 
of sub-Saharan Africa and South Asia have recently been identified as 
suitable for sustainable intensification efforts”!. Still, if conservation 
agriculture is to be successful at increasing crop productivity in these 
areas, it must be adjusted to local conditions through an innovative, 
multi-stakeholder driven approach that is sensitive to market oppor- 
tunities, equipment availability, and farmers’ production objectives and 
needs*'”'°. Our findings further suggest that no-till in combination with 
the other two conservation agriculture principles, when targeted appro- 
priately, may become an increasingly important strategy to deal with 
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Figure 2 | Comparison of rainfed crop yield in no-till versus conventional 
tillage systems in relation to the other two principles of conservation 
agriculture as a function of climate. The influence of (a) ‘Dry’ and (b) 
‘Humid’ climates, defined by aridity index values (mean annual precipitation 
divided by potential evapotranspiration) less or more than 0.65, respectively. 
Categories represent studies that indicated the presence or absence of residue 
retention and crop rotation for both no-till and conventional tillage systems: 
+RR+CR (residue retention + crop rotation), +RR/+CR (either residue 
retention or crop rotation), or -RR-CR (without residue retention or crop 
rotation). The number of observations and total number of studies included in 
each category are displayed in parentheses. Error bars represent 95% 
confidence intervals. Significant differences between categories are indicated by 
P values based on randomization tests. 


soil moisture stress due to climate change. Projected changes in pre- 
cipitation and temperature are expected to cause increased drying and 
drought in important agricultural production areas of the world”. 
Depending on the severity of these changes, a number of adaptations 
will be required to maintain global agricultural production levels; the 
attributes of soil moisture conservation and water use efficiency with the 
three principles of conservation agriculture could play an important role. 

It is often suggested that the risk for short-term decreases in crop 
productivity represents a major barrier for farmers considering conser- 
vation agriculture". Our results confirm this; regardless of whether 
the other two conservation agriculture principles are implemented, no- 
till reduces yields in the first few years following adoption (Fig. 3a). How- 
ever, the yield decline in initial years is minimized when all three principles 
are applied compared with one principle (—3.0% versus — 11.4%, respec- 
tively). Moreover, despite no-till yields in all categories becoming com- 
parable with conventional tillage in the medium term (Fig. 3b), after 10+ 
years yields begin to decline when only no-till is implemented (Fig. 3c). 
Hence, to mitigate the negative impacts of no-till, our findings empha- 
size the importance of implementing all three principles and the overall 
need for strategies to overcome yield reductions in early and later (10+) 
years. Although the economic benefits of conservation agriculture may 
be more strongly driven by cost reductions rather than increased yields”, 
negative yield outcomes can discourage poorer farmers who tend to focus 
on short-term gains, probably making it an overriding factor limiting 
the adoption of conservation agriculture”. 

It cannot be determined from our database whether initial yield reduc- 
tions are caused by biophysical conditions (for example, soil structure, 
decomposition of residues on the soil surface) or sub-optimal manage- 
ment (that is, a learning curve effect). The transition to no-till integrated 
with the other two conservation agriculture principles is challenging 
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Figure 3 | Comparison of yield in no-till versus conventional tillage systems 
in relation to the other two principles of conservation agriculture over time. 
The influence of (a) 1-2, (b) 3-9, and (c) 10+ years following no-till 
implementation. Categories represent studies that indicated the presence or 
absence of residue retention and crop rotation for both no-till and conventional 
tillage systems: +RR+CR (residue retention + crop rotation), +RR/+CR 
(either residue retention or crop rotation), or -RR-CR (without residue 
retention or crop rotation). The number of observations and total number of 
studies included in each category are displayed in parentheses. Error bars 
represent 95% confidence intervals. Significant differences between categories 
are indicated by P values based on randomization tests. 


as it represents a holistic change in management requiring adaptation 
at the individual farm-level. A targeted review of no-till studies in sub- 
Saharan Africa and South Asia reported a high risk of short-term yield 
declines for major annual crops”®. Similar to previous work’, these authors 
also noted that implementation of the other two conservation agricul- 
ture principles can minimize this risk and that no-till yield losses tend 
to diminish with time’’. Interestingly, regardless of initial impacts on 
yield, our results do not indicate that no-till outperforms conventional 
tillage in the 10+ year category. One possible explanation for these results 
is that weed, pest, and disease pressures may increase with continuous 
no-till systems over time depending on how the other conservation agri- 
culture principles are implemented”, possibly offsetting improvements 
in soil quality. Further research is needed to identify initial and long- 
term yield constraints of no-till systems. 

When considering the relative importance of crop rotation versus 
residue management practices in enhancing yield of no-till systems, our 
meta-analysis does not provide evidence that one principle regulates 
productivity more than the other. Across all observations, the indivi- 
dual effects of residue retention and crop rotation reduce the negative 
impacts of no-till by 4.8 and 3.8%, respectively, although differences 
between categories are insignificant (Fig. 1). However, in dry climates 
these principles each have a much stronger effect on rainfed crop yields, 
reducing yield losses by 10.1 and 11.0%, respectively. Indeed, previous 


LETTER 


work has stressed the importance of residue retention to enhance soil 
and cropping system benefits of reduced tillage systems*”®, with our 
study being the first to quantify impacts on crop productivity at a global 
scale. Our results illustrate the need to implement at least one, and pref- 
erably both, principles in addition to no-till in rainfed cropping systems 
in dry climates, while also suggesting that consistent yield declines with 
no-till in humid environments may be primarily caused by factors unre- 
lated to these principles. 

Clearly, there are important environmental (for example, reduced 
erosion and improved soil quality) and economic outcomes of contin- 
uous no-till>'*!” beyond the scope of the present analysis that might 
justify adoption at the farm scale and should be considered in a trade- 
off analysis against yield reductions. Nevertheless, agricultural regions 
containing a disproportionate number of the world’s poor, including 
sub-Saharan Africa and South Asia, currently struggle with food secu- 
rity issues and have a high probability of experiencing yield reductions 
due to climate change in the future”. Despite the promising effects of 
no-till in certain contexts (that is, rainfed agroecosystems in dry cli- 
mates), we stress that benefits in yield are only seen when the other two 
conservation agriculture principles are also implemented. Of far greater 
concern is that no-till alone tends to have the opposite of the intended 
goal, thereby placing farmers at increased risk of yield losses. It is pre- 
cisely resource-poor and vulnerable smallholder farming systems that 
will have the greatest challenges adopting the other two principles, most 
notably the retention of crop residues due to strong competition for 
residues by livestock and other uses*’’. Hence, efforts to expand con- 
servation agriculture further must remain conscious of the potential for 
no-till to ‘backfire’ in these contexts. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Data collection. We comprehensively searched the peer-reviewed literature for 
publications investigating the effects of no-till in relation to the other two conser- 
vation agriculture principles on crop yields from Jan 1980 to May 2013 using Scopus 
(Elsevier). Search terms included ‘tillage’, ‘no till’, ‘zero till’, ‘direct drill*’, or ‘con- 
servation ag*’ in the article title and ‘yield’ in the article title, abstract, or keywords. 
Conference proceedings and non-English language publications were excluded. 
This search produced a total of 2,471 publications, which were screened on the basis 
of the following criteria: (1) studies had to represent field experiments containing 
side-by-side comparisons of no-till and conventional tillage practices; (2) no-till 
treatments consisted of zero tillage immediately before crop establishment for a 
given growing season (that is, reduced tillage treatments such as strip-tillage were 
rejected); (3) crop yields were reported; (4) location of the experiment was stated; 
(5) management information regarding at least one other conservation agricul- 
ture principle was available (that is, residue management or crop rotation); and 
(6) confounding effects between treatments were absent (that is, differences in residue 
management, seeding rates, fertilizer rates, etc. were determined to be negligible). 
When more than one form of tillage was assessed in a study, we selected the treat- 
ment representing the greatest soil disturbance (generally mouldboard plow). 
Although it only represented a small portion of the data, no-till treatments were 
not always required to represent continuous zero-tillage (for example, if two crops 
were grown per year and the first crop required tillage but the second was planted 
using no-till practices, yield comparisons for the second crop were included). 

Studies were rejected if it was unclear from reading the experimental methods 
whether factors other than tillage differed between treatments with the exception 
of herbicides (the absence of tillage as a weed control strategy generally requires 
changes in herbicide management under no-till”). Owing to the large size of the 
database, particular attention was given to avoiding data duplication (for example, 
when different studies reported the same data). Thus, if it was unclear whether a 
publication contained duplicate data, it was rejected. A number of publications from 
the conservation agriculture literature were rejected because of the lack of a control 
treatment that satisfied our criteria (that is, conservation agriculture treatments 
representing all three principles were compared with conventional tillage treat- 
ments with residues removed and no rotation). 

Means for no-till and conventional tillage yields were extracted from each study 
in addition to study and site characteristics including crop type, study location, 
study duration, irrigation, residue management, and crop rotation practices. In 
cases where yield data were only presented in figures, values were extracted using 
Plot Digitizer (http://plotdigitizer.sourceforge.net/). In a few instances where yield 
data were only reported as a percentage change relative to the other treatment, we 
assumed absolute yield values for the reference treatment and calculated the nat- 
ural log of the response ratio normally as described below (because this metric only 
quantifies the relative difference between means, the same value is produced regard- 
less of the absolute magnitude of means). To investigate changes in yield over time, 
the number of years since the initiation of no-till was recorded for each observation. 
Observations were excluded from the analysis of no-till duration when only mean 
yields over a number of years were presented. For the small number of studies in 
which tillage occurred periodically in no-till systems, the duration of no-till was 
reset to time zero with each tillage event. For example, if tillage occurred during the 
rice phase of a rice-wheat rotation during five consecutive years, each wheat yield 
observation was recorded as the first year under no-till. 

The effect of climate was assessed for yields under rainfed conditions by deter- 
mining the aridity index (mean annual precipitation divided by potential evapo- 
transpiration) for each study using latitude and longitude coordinates and the 
WorldClim database”’. Following the generalized climate classification scheme”, 
aridity index values less and more than 0.65 were categorized as ‘dry’ and ‘humid’, 
respectively. If latitude and longitude coordinates were not stated, an attempt was 
made to contact study authors. Otherwise, coordinates were estimated using the 
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location of the nearest city or the experiment station at which the study took place. 
Ina few cases where only large geographical areas were stated in publications, coor- 
dinates and aridity index values were not estimated. 

Crop rotation, residue management, and irrigation practices were recorded for 
each study as categorical variables where possible. Crop rotation was treated as a 
binary variable (yes/no), where ‘yes’ indicated that two or more crops were grown 
in sequence in the same field over time (including the use of a cover crop), and ‘no’ 
indicated continuous cultivation of a single crop. Residue management was also 
treated as a binary variable (retained/removed), where ‘retained’ indicated that 
crop residues were retained in the field following harvest each growing season (or 
in a minority of cases, residues were supplied from elsewhere or by growing a cover 
crop between seasons), and ‘removed’ indicated that residues were physically removed 
from the field or burned following harvest. Information for categorical variables 
was extracted from the Materials and Methods section of publications, and to a 
lesser extent was inferred from discussions of crop management details found in 
the Introduction or Discussion sections. Irrigation practices (yes/no) were recorded 
when available, with cells left blank when irrigation practices were unclear. 

Before data analysis, observations were grouped into categories depending on 

the presence of residue management and crop rotation practices as determined by 
information reported in the original studies. For each paired yield observation, we 
required that no-till and conventional tillage treatments received the same residue 
management and rotation practices. Thus, categories represented the following 
comparisons: three principles (no-till versus conventional tillage, both with resi- 
dues retained + crop rotation), two principles (no-till versus conventional tillage, 
both with either residues retained or crop rotation), and one principle (no-till versus 
conventional tillage, both without residue retention or crop rotation). 
Data analysis. Following previous work”, we calculated the natural log of the 
response ratio (the ratio of no-till to conventional tillage yields) as the effect size in 
our meta-analysis. Because within-study variance measures for mean yields were 
available for less than a few percent of studies, individual observations were weighted 
by replication, with weights = (ncony. X Mno-titl)/(Meonv. + Mno-till), Where Neony, and 
Myo-til) are the number of replicates for conventional tillage and no-till treatments, 
respectively*’. In situations where more than one observation from a study was 
included in a category, weights were divided by the total number of observations 
from that study. When yield values for a treatment equalled zero and thereby indi- 
cated crop failure or experimental error, observations were excluded. Moreover, 
observations more than five standard deviations from the weighted mean effect 
size within each category were excluded (this represented <0.5% of data on aver- 
age). All statistical analyses were conducted with R (version 3.0.2). Bootstrapping 
procedures within the ‘boot’ package”? were used to generate 95% confidence inter- 
vals for weighted mean effect sizes using 4,999 iterations”’. Between-group hetero- 
geneity was assessed using randomization procedures based on 4,999 replications”. 
Results were considered significant if confidence intervals did not overlap with zero 
and randomization tests yielded P values <0.05. For ease of interpretation, all 
results were back-transformed and reported as percentage change in yield for no- 
till relative to conventional tillage practices. 
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Extended Data Figure 1 | The location of studies containing yield comparisons between no-till and conventional tillage systems used in the meta-analysis. 


©2015 Macmillan Publishers Limited. All rights reserved 


Mates I Tea 


doi:10.1038/nature13827 


Long-term phenotypic evolution of bacteria 


German Plata’, Christopher S. Henry’ & Dennis Vitkup* 


For many decades comparative analyses of protein sequences and 
structures have been used to investigate fundamental principles of 
molecular evolution’”. In contrast, relatively little is known about 
the long-term evolution of species’ phenotypic and genetic properties. 
This represents an important gap in our understanding of evolu- 
tion, as exactly these proprieties play key roles in natural selection 
and adaptation to diverse environments. Here we perform a com- 
parative analysis of bacterial growth and gene deletion phenotypes 
using hundreds of genome-scale metabolic models. Overall, bacte- 
rial phenotypic evolution can be described by a two-stage process 
with a rapid initial phenotypic diversification followed by a slow 
long-term exponential divergence. The observed average divergence 
trend, with approximately similar fractions of phenotypic properties 
changing per unit time, continues for billions of years. We experimen- 
tally confirm the predicted divergence trend using the phenotypic 
profiles of 40 diverse bacterial species across more than 60 growth 
conditions. Our analysis suggests that, at long evolutionary distances, 
gene essentiality is significantly more conserved than the ability to 
utilize different nutrients, while synthetic lethality is significantly less 
conserved. We also find that although a rapid phenotypic evolution 


is sometimes observed within the same species, a transition from high 
to low phenotypic similarity occurs primarily at the genus level. 
Analyses of phenotypic evolution, such as the morphological varia- 
tion of beaks in Darwin’s finches’, provided the original impetus and 
context for understanding natural selection. Because the evolutionary 
importance and physiological role of specific phenotypic traits change 
over time, it is often difficult to connect genotype to phenotype to fit- 
ness across long evolutionary distances, especially for metazoan organ- 
isms. For microbial species, on the other hand, the ability to metabolize 
different nutrient sources, although clearly not the only important phe- 
notype, always remains an essential determinant of their fitness and 
lifestyle. Even though a large-scale comparative analysis of microbial 
phenotypes—such as growth on different nutrients or the impact of 
genetic perturbations—is currently challenging owing to a relative pau- 
city of experimental data, we rationalized that thoroughly validated com- 
putational methods can be used to investigate the phenotypic evolution 
of diverse bacterial species. Flux balance analysis (FBA)’*, in particular, 
has been previously used to accurately predict gene and nutrient essen- 
tiality, growth yields, and evolutionary adaptations to environmental 
and genetic perturbations’. Notably, the accuracy of FBA methods has 
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Figure 1 | Diversity of considered bacterial families. The cladogram shows 
the evolutionary relationship between the 100 bacterial families that include the 
322 species considered in our study. The tree is based on the average 16S 
ribosomal RNA (rRNA) genetic distances between species in each family 


(see Methods). The numbers of considered species in each family are shown in 
parentheses. Different colours represent different bacterial classes. The tree 
was rooted using the Methanosarcina barkeri rRNA sequence. 


Department of Systems Biology, Center for Computational Biology and Bioinformatics, Columbia University, New York, New York 10032, USA. Integrated Program in Cellular, Molecular, Structural and 
Genetic Studies, Columbia University, New York, New York 10032, USA. 3Mathemathics and Computer Science Division, Argonne National Laboratory, Argonne, Illinois 60439, USA. “Department of 


Biomedical Informatics, Columbia University, New York, New York 10032, USA. 


15 JANUARY 2015 | VOL 517 | NATURE | 369 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


been independently demonstrated for many dozens of species encom- 
passing diverse phylogenetic distributions and growth environments®. 
We selected for our analysis more than 300 phylogenetically diverse 
bacteria (Fig. 1) for which genome-scale metabolic models were recon- 
structed using a recently developed protocol’ (see Methods). 

To investigate the long-term evolution of growth phenotypes, we con- 
sidered 62 carbon sources that are commonly used by microbial species 
for growth and energy production’. For each considered species we used 
FBA to determine a subset of the compounds that could be used for 
biomass synthesis or ATP generation—two key metabolic objectives of 
bacterial growth’. This analysis resulted in binary phenotypic profiles 
that describe the ability of each microbial species to use each of the con- 
sidered compounds (see Methods). The evolution of these phenotypic 
profiles—that is, the change in phenotypic similarity as a function of spe- 
cies divergence (genetic distance)—is shown in Fig. 2a, (see also Extended 
Data Fig. 1). Notably, this analysis demonstrated that the average long- 
term evolution of growth phenotypes can be approximated well by an 
exponential decay (see Methods and Extended Data Table 1). A three- 
parameter exponential model fits the data in Fig. 2a significantly better 
than simpler alternative models (Extended Data Table 2). Similar diver- 
gence trends were observed for larger sets of carbon source compounds 
(Extended Data Fig. 2), and for compounds that could be used as a 
source of nitrogen (Fig. 2b). The observed trend was also robust towards 
subsampling or removal of specific species and families used in the anal- 
ysis (Extended Data Fig. 3). 

The observed exponential trends suggest that as microbial species 
diverge over planetary timescales and adapt to different environmental 
niches, approximately similar fractions of phenotypic properties change 
per unit time. For species separated by more than 1 billion years of 
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Figure 2 | Evolution of bacterial metabolic growth phenotypes. Genetic 
distances in the figure are based on bacterial 16S rRNA sequences. a, The 
evolution of phenotypic similarity in the usage of carbon sources for biomass 
synthesis. The colours represent the point density at a given genetic distance for 
all pairwise comparisons between metabolic models (n = 26,106). The black 
line shows a three-parameter exponential fit to the computational predictions; 
the red line shows a moving average of the predictions. b, Like a, but phenotypic 
similarity in the usage of nitrogen sources for biomass production across 
metabolic models (n = 36,856). c, Experimental analysis of the long-term 
phenotypic divergence trend. Grey points represent pairwise comparisons of 
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evolution (~0.2 genetic distance in Fig. 2), the divergence of growth 
phenotypes approaches saturation around a similarity of 21% (Fig. 2a), 
which is higher than the value expected by chance (~12%) given the 
average number of carbon compounds used by the models. This dif- 
ference is likely due to a widespread utilization of common nutrient 
sources across bacterial species (see Extended Data Table 3)". 

Notably, before the evolution of growth phenotypes settles into the 
aforementioned average trend, a much higher rate of phenotypic evo- 
lution is observed for pairs of bacteria at very close genetic distances 
(<0.01, or ~50 million years’). Our computational analysis predicts 
~71% phenotypic similarity for closely related bacteria (Fig. 2, Extended 
Data Table 1), which agrees well with available experimental data on intra- 
species phenotypic similarity: for example, 75% for the utilization of 
carbon sources in Escherichia coli’* and 69% for Campylobacter jejuni’. 
The diversity of bacteria observed at close distances reflects a well- 
documented genetic and phenotypic variability within bacterial pan- 
genomes". The observed patterns also suggest that phenotypic evolution 
proceeds in two different stages, namely through fast phenotypic diversifi- 
cation of closely related strains followed by a slower exponential divergence 
lasting billions of years. Notably, patterns of multi-stage and hierarchical 
evolution have been observed in other systems, for example in bacterial 
and eukaryotic developmental networks'*"*. 

To validate experimentally the predicted patterns of long-term phe- 
notypic evolution of bacteria, we obtained GENIII Biolog Phenotype 
Microarrays® data for 40 diverse microbial species (Extended Data Fig. 4 
and Supplementary Data 1). Phenotype Microarrays data were used to 
determine the ability of each considered bacteria to utilize the 62 different 
carbon sources used in the simulations (Fig. 2c; see Methods). In agree- 
ment with previous results’, FBA predicted microbial growth phenotypes 
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carbon usage phenotypes (Biolog data) between 40 bacterial species (n = 780). 
The black squares represent the average values of experimental phenotypic 
similarity at different divergence distances. The black line represents an 
exponential fit to the experimental phenotypic similarity data; the red line 
represents an exponential fit to the computationally predicted phenotypic 
similarity data for biomass synthesis. d, The variability of experimental and 
computationally predicted phenotypic similarity at different divergence distances. 
The variability was quantified by the coefficient of variation, defined as the ratio 
of the standard deviation to the mean. Error bars in c and d represent s.e.m. 
obtained on the basis of 10,000 bootstrap re-samplings of the considered species. 
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Figure 3 | Distribution of phenotypic similarity at different levels of 
bacterial taxonomic classification. The distributions of phenotypic similarity 
in the usage of carbon sources for biomass synthesis were obtained based on 
computational simulations of metabolic models (n = 26,106). The dashed line 
connects the average values (black dots) of computational predictions at each 
taxonomic level. The red asterisks in the figure indicate the average values of 
experimental data obtained using Biolog arrays (see Methods). 


with an average accuracy of ~70%. Importantly, the experimental results 
agree well with the computationally predicted average trend describing 
phenotypic bacterial divergence (Fig. 2c). Also, for the experimental 
data, as well as for the computational predictions, the three-parameter 
exponential model fitted the data significantly better than simpler alter- 
native models (Extended Data Table 2). The comparison of computational 
and experimental values for the coefficient of variation of phenotypic 
similarity shows that computational predictions capture well not only 
the average trend but also the variability of phenotypic similarity for 
bacteria at different genetic distances (Fig. 2d). Overall, the analyses of 
experimental data suggest that although individual models need to be 
further validated and improved, high-throughput metabolic reconstruc- 
tions can be used for comparative functional studies across a large number 
of diverged species. 

We next investigated the diversity of metabolic growth phenotypes at 
different levels of conventional taxonomic classification (Fig. 3). Although 
bacteria from the same species show mostly similar phenotypic prop- 
erties, the long left tail of the top distribution in Fig. 3 suggests that some 
organisms have substantial phenotypic differences even at this basic 
taxonomic level. At the genus level the distribution is very broad, with 
an average similarity of ~60%; this suggests that transitions from high 
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Figure 4 | Evolution of bacterial genetic phenotypes. Genetic distances are 
based on bacterial 16S rRNA sequences. a, The evolution of similarity in gene 
essentiality across the considered bacterial species. The colours represent the 
point density at a given genetic distance for pairwise comparisons among 
considered models (n = 48,920). The red line shows a three-parameter 
exponential fit to the computational predictions. Black points represent the 
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to low phenotypic similarity usually occur at the level of genera. On the 
contrary, much lower conservation levels are observed for taxonomic 
ranks beyond the level of families, where the differences between the 
ranks are relatively small. This analysis suggests that computational 
approaches similar to the one presented here could be useful in refin- 
ing bacterial taxonomy. 

To complement the analysis of metabolic growth phenotypes, we 
used FBA to investigate the long-term evolution of gene deletion pheno- 
types (Fig. 4). Specifically, we considered the evolution of metabolic 
gene essentiality and synthetic lethality (see Methods). First, we con- 
firmed a high (~76%) accuracy of FBA gene essentiality predictions for 
considered species with available experimental data’ (Extended Data 
Table 4). Second, our analysis demonstrated that the average long-term 
evolution of gene essentiality can also be approximated by an exponen- 
tial divergence (Fig. 4a). Notably, the average rate of evolution of meta- 
bolic gene essentiality is substantially faster and saturates at closer genetic 
distances than the evolution of growth phenotypes (Fig. 2, see Extended 
Data Table 1). Even at long evolutionary distances, for an average pair 
of microbial species more than half of the conserved essential genes 
in one species usually remain essential in the other. Reassuringly, the 
predicted average trend (Fig. 4a, red line) is consistent with available 
experimental data (Extended Data Table 4) for microbial species with 
genome-wide gene deletion screens (Fig. 4a, black dots/black line). 

In contrast to gene essentiality, our analysis revealed a very low con- 
servation of synthetic lethality between metabolic genes (Fig. 4b). Fol- 
lowing a common definition, we considered a pair of non-essential genes 
to be synthetic lethal if simultaneous in silico deletion of the correspond- 
ing reactions from FBA models made biomass synthesis infeasible. Even 
at close genetic distances (<0.01 in Fig. 4b) synthetic lethality is con- 
served, on average, for only ~30% of orthologous metabolic gene pairs. 
At close distances there is also a substantial variability in the conserva- 
tion of synthetic lethality across species. As bacterial species diverge fur- 
ther, the average conservation of synthetic lethality drops to ~5%. This 
suggests that synthetic lethality is much more sensitive to changes in 
microbial genotypes than gene essentiality and metabolic growth phe- 
notypes. Only several comprehensive studies, none of them in bacteria, 
have been performed to assess experimentally the conservation of genetic 
interactions and synthetic lethality’”"*. Comparison of fitness data from 
budding and fission yeast revealed a conservation of epistatic gene pairs 
of ~29% (ref. 17) (corresponding to ~17% similarity). On the other 
hand, ~5% of the orthologues of synthetic lethal gene pairs in yeast were 
also found to be synthetic lethal in Caenorhabditis elegans’* (~2.5% 
similarity). Although these data were obtained in eukaryotic species 
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similarity in synthetic sick/synthetic lethality for two pairs of eukaryotic species 
(see Methods). 
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and the FBA accuracy for predictions of genetic interactions is lower 
than for essentiality or growth phenotypes”, the available experimental 
results (Fig. 4b, black dots) are generally consistent with the average 
divergence trend predicted in our bacterial simulations. 

We note that the observed behaviour of long-term phenotypic diver- 
gence is somewhat reminiscent of the molecular clock in protein evolution’. 
Similar to protein evolution, it is likely that the phenotypic divergence 
trends are due to both bacterial adaptation to diverse environmental 
niches and neutral changes”. The relative contribution of adaptive and 
neutral changes is likely to be different in each particular linage and evo- 
lutionary context. Our analysis shows that growth phenotypes, gene 
essentiality, and synthetic lethality diverge with different rates and have 
different sensitivities to bacterial genotypes. It is likely that many other 
phenotypic properties, such as the ability to synthesize different com- 
pounds, interact with other species, or withstand specific environmental 
perturbations, will also show distinct evolutionary patterns. We believe 
that the accelerating pace of genomic and metagenomic sequencing, 
and continuous improvement in computational annotation methods”, 
will soon allow mapping of the evolution of various phenotypic prop- 
erties across the entire bacterial tree of life. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Metabolic models. We obtained 322 genome-scale metabolic models using a recently 
published protocol for automatic network reconstruction’. To minimize possible 
biases due to computational gap-filling and network auto-completion, we only 
considered models in which more than 80% of the reactions were directly based on 
available gene annotations. To prevent biases related to uneven sampling of bac- 
terial phylogenic space, we did not use models from the order Enterobacteriales, 
which contains a significantly higher number of sequenced genomes compared with 
other bacterial linages. The exact identity of the considered species did not have a 
significant impact on the predicted average trends (Extended Data Fig. 3). 
Computation of genetic distances. For the considered bacteria, 16S rRNA gene 
sequences were obtained from GenBank”. Sequences were aligned using Clustal 
Omega” to a reference alignment of small subunit rRNA sequences from the SILVA 
database*. Genetic distances were then calculated on the basis of the multiple 
sequence alignment using the Dnadist program in the Phylip software package”; 
the F84 model of nucleotide substitution, with default parameter values, was used. 
The cladograms in Fig. 1 and Extended Data Fig. 4 were computed from the distance 
matrix using the Fitch program in Phylip with default parameters; the M. barkeri 
rRNA was used as the outgroup sequence. The taxonomic classification used in 
Fig. 3 was obtained from the National Center for Biotechnology Information tax- 
onomy database”*. 

Prediction of growth phenotypes. FBA allows one to determine feasible values 
of metabolic reaction fluxes subject to reaction stoichiometry constraints and the 
assumption of metabolic steady state’. Additional constraints, such as upper and 
lower bounds for metabolic fluxes or flux combinations, can be applied to the models. 
All FBA calculations in our manuscript used the COBRA toolbox’’. We used the 
following procedure to determine the ability of each species to utilize the considered 
nutrient sources. First, we identified a set of compounds required by the models to 
simulate growth on different carbon or nitrogen sources (Supplementary Data 2); 
these compounds include various vitamins, nucleotides and amino acids* as well as 
several model-specific requirements (see below). To test the ability of each model 
to use different carbon sources, an in silico growth medium was defined where the 
aforementioned substances were constrained to a maximum combined uptake of 
10 mmol of carbon per gram dry weight, and all other carbon compounds were 
removed. All carbon-free compounds were made available in the simulated med- 
ium (with the maximum uptake rate of | mol g”' dry weight). An analogous pro- 
cedure was used to define a growth medium for testing nitrogen sources. Second, 
for each considered carbon or nitrogen source (compound), we used FBA to cal- 
culate the maximum biomass or ATP synthesis rate when the compound was made 
available in the corresponding in silico medium (maximum uptake rate of 1 mol g ! 
dry weight). Similar to the treatment of experimental data (see experimental pro- 
cedures below), biomass or ATP flux values were normalized on a scale of 0-100 
corresponding to the minimum (no carbon or nitrogen source tested) and maxi- 
mum flux values across considered compounds, respectively. Similar to experimental 
measurements, carbon and nitrogen sources scoring 10 or above were considered 
as being positive for growth. The metabolic compounds tested in our analysis cor- 
respond to carbon and nitrogen sources that are commonly used by multiple bac- 
teria (Extended Data Table 3) and are assayed in Biolog MicroPlates*. To prevent 
low phenotypic similarities arising because of models with a very low overall number 
of positive growth phenotypes, we only considered models that could synthesize 
biomass on more than five of the tested compounds; the exact value of this cutoff 
had little effect on the observed average trends (Extended Data Fig. 5a). In total, 
229 and 272 models were used for the analysis of carbon and nitrogen sources for 
biomass synthesis, respectively. 

Definition of an in silico growth medium. Similar to bacterial growth in vivo, 
many of the metabolic models used in our analysis are auxotrophic for specific com- 
pounds, beyond the main carbon and nitrogen sources tested for their ability to 
support microbial growth (62 carbon and 68 nitrogen compounds); that is, the 
models can simulate biomass or ATP synthesis only if small amounts of additional 
nutrients are available in the simulated growth media. To define a single minimal 
medium, used across all models to test growth on the main carbon sources, we 
used the following procedure. First, in addition to the main carbon sources (avail- 
able with a maximum uptake of 1 mol g”' dry weight), all metabolic compounds 
that could be imported by the models were made available in the simulated media 
with a maximum combined carbon uptake of 10 mmol g~' dry weight; we note 
that this maximum uptake rate is only 1% of the maximum uptake of the main 
carbon sources. Second, we determined which of the main carbon sources could 
support growth under these conditions. Third, for each model the additional carbon 
compounds were sequentially removed (in a random order) from the simulated 
media until no compound could be further removed while allowing growth on the 
main carbon sources determined in the second step. Fourth, the additional carbon 
sources required for growth in more than 75% of the tests with a positive growth 
phenotype were combined across all models; this resulted in the carbon-containing 
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component of the minimal media. Fifth, the same procedure was used to determine 
the nitrogen-containing component of the minimal media. Sixth, the carbon- and 
nitrogen-containing components were combined to produce the minimal media 
used in the study (Supplementary Data 2). Notably, very similar results were observed 
using different minimal media obtained from independent runs of the aforemen- 
tioned procedure. Very similar results (Extended Data Fig. 6) were also obtained 
when the in silico growth medium, used for all tests, contained all possible nutrients 
(without any removals) with maximum uptakes rates of 1 mol g' dry weight for 
the main carbon or nitrogen sources, and with combined maximum uptake rates of 
10 mmol g’* dry weight (1% of the uptake for the main nutrients) for additional 
carbon or nitrogen compounds, respectively. 

Prediction of essentiality and synthetic lethality. To determine essential genes, 
we first established the association between every gene and corresponding meta- 
bolic reactions. We then simulated gene deletions by setting the maximal fluxes 
through corresponding reactions that cannot be catalysed by the products of other 
genes to zero. If such an in silico deletion of a gene made it impossible, on the basis 
of FBA calculations, to produce a non-zero biomass, the gene was considered to 
be essential. A pair of non-essential genes was considered to be synthetic lethal if 
simultaneous deletion of the two genes made it impossible to produce a non-zero 
biomass. All FBA simulations for testing gene essentiality and synthetic lethality 
were performed, similar to common experimental procedures, using an in silico 
rich medium: that is, non-zero fluxes were allowed through every transport reaction 
in the models. Genes associated with lumped reactions, such as ‘protein synthesis’, 
were not considered in the calculations. To prevent low phenotypic similarities 
arising because of models with a very small number of essential genes, only models 
with more than ten predicted essential genes or synthetic lethal gene pairs with 
mapped orthologues were considered in the analysis; 314 and 290 models were used 
for the analysis of essentiality and synthetic lethality, respectively. 

Quantifying phenotypic similarity. For a given set of features, namely carbon 
sources, essential genes, or synthetic lethal gene pairs, similarities between species 
were quantified by Jaccard’s similarity index. Jaccard’s index is defined as the size 
of the intersection between two given sets divided by the size of the union of the 
two sets; for example, if A is the set of all carbon sources that can be used by species 
a, and B the set that can be used by species b, then Jaccard’s carbon source simi- 
larity between a and b is defined as J(A,B) = |A N B|/|A U B|. Importantly, to cal- 
culate the similarity of gene essentiality and synthetic lethality between species, we 
only considered orthologous genes and gene pairs that are shared between corre- 
sponding metabolic networks. Orthologous genes were identified using bi-directional 
BLASTP” hits (with expect (E) values <0.05) between the species’ genomes. 
Analysis of experimental Biolog data. A collection of 40 microbial species span- 
ning a wide range of phylogenetic distances (Extended Data Fig. 4) was used to con- 
firm the computationally predicted trends. The ability of these species to metabolize 
the 62 carbon sources used in the computational analysis was determined using 
Biolog GENIII Phenotype Microarrays’; all data were obtained directly from the 
Biolog GEN III database (Biolog). The Phenotype Microarrays technology is based 
on the reduction of a tetrazolium dye, which allows determination of the usage of 
different nutrient sources across multiple growth conditions*. Biolog assays were 
performed essentially as described in the GEN III MicroPlate instruction manual. 
Colorimetric measurements after a 24 h incubation period for each species and each 
carbon source were expressed on a scale of 0-100, representing the average colour 
density (across at least five biological replicates) in each well of the Biolog plate 
relative to the negative and positive controls; only scores of 10 or above were con- 
sidered as evidence that a tested compound was used by a species. This cutoff value 
was obtained on the basis of the bimodal-like distribution of the data’? (Extended 
Data Fig. 5b); similar results were obtained using other cutoff values (Extended 
Data Fig. 5c). The accuracy of FBA in predicting microbial growth phenotypes was 
evaluated using the nine species that were present both in the computational and 
experimental analyses. The experimental values in Fig. 3 (red stars) were based on 
data from the aforementioned 40 species, and the intra-species similarity studies in 
refs 12 and 13. 

Experimental gene essentiality data. Gene essentiality data were compiled for 19 
species with genome-wide gene deletion screens (Extended Data Table 4). Species 
from the genus Mycoplasma were excluded from this analysis because of their very 
small genomes and a very high (~80% (ref. 29)) fraction of essential genes. For every 
pair of species, orthologous genes were identified using bi-directional BLASTP hits 
(with E values <0.05). The similarity of gene essentiality was determined for genes 
with identified orthologues annotated as enzymes in the KEGG database*’. To 
estimate the similarity of gene essentiality at close genetic distances, we also con- 
sidered partial essentiality data for Streptococcus pneumoniae R6 and Staphylococcus 
aureus N315, which were compared with essential genes in Streptococcus sanguinis 
SK36 and S. aureus NCTC 8325, respectively. To use these incomplete data, and 
given the similarity of the species’ genome sizes, we assumed symmetry of essential 
gene conservation: that is, that the number of essential genes in one species that are 
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not essential in the other is the same for both bacteria. A similar approach was used 
to estimate Jaccard’s similarity of genetic interactions between eukaryotic species 
on the basis of published data'”’® (Fig. 4b). 

Exponential model fits. Pairwise divergence of bacterial phenotypic similarity (y) 
as a function of genetic distance (t) was fitted using the following equation: y = 
a+ be “, where the parameter a represents the saturation level for phenotypic 
divergence at long genetic distances, (a + b) represents the level phenotypic sim- 
ilarity at close genetic distances, and the parameter c quantifies the divergence rate, 
namely the phenotypic similarity decrease per unit of genetic distance (time). Larger 
values of c correspond to faster divergence of the phenotypic similarity. The para- 
meter a was not considered in the nested two-parameter exponential model used for 
model comparison. To quantify the genetic distance between bacterial species, we 
used the divergence between their 16S rRNA sequences; 1% 16S rRNA distance 
approximately corresponds to 50 million years since divergence from a common 
ancestor". 
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Extended Data Figure 1 | The evolution of phenotypic similarity in the 
usage of carbon sources for ATP production. Genetic distances are based on 
16S bacterial rRNA sequences. The colours represent the point density at a 
given genetic distance for all pairwise comparisons between metabolic models 
(n = 20,910). The black line shows a three-parameter exponential fit to the 
computational predictions; the red line shows a moving average of the 
predictions. 
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Extended Data Figure 2 | The evolution of phenotypic similarity for 
different sets of carbon sources. Genetic distances are based on 16S bacterial 
rRNA sequences. Phenotypic similarities of biomass synthesis are shown for 62, 
149, and 446 carbon sources. Solid lines represent moving averages (using a 
0.05 genetic distance window) of computational predictions; dashed lines 
represent exponential fits to the data. 
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Extended Data Figure 3 | The effect of species selection on observed fraction of auto-completion reactions are considered, (b) models from 
patterns of phenotypic divergence. The black lines in all panels (marked individual families that include more than ten modelled species are excluded 


“Original fit’) represent the exponential fit of the phenotypic similarity (carbon _ from the analysis, (c) only a maximum number of species per family is 
source utilization) as a function of genetic distance for all pairs of considered _ considered, and (d) a subset of species is chosen at random from the pool of all 


models, that is, models with fewer than 20% auto-completion reactions and considered models. In d, the average values at different genetic distance bins are 
more than five predicted carbon sources for growth. The observed trends of shown for 1,000 random samples of a given number of species; error bars 
phenotypic evolution remain similar when (a) only models with a smaller represent the s.e.m. obtained on the basis of the 1,000 replicates. 
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Extended Data Figure 4 | Evolutionary relationship between 40 species for which experimental phenotype microarray data were considered. The cladogram 
is based on 16S bacterial rRNA sequences. Different colours indicate different bacterial classes. The tree was rooted in the M. barkeri rRNA sequence. 
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Extended Data Figure 5 | The effect of cutoff selection on the computational _0 are not plotted). The dashed red line shows the cutoff score of 10 used in 


and experimental phenotypic similarity trends. a, The exponential fits of the main analysis. c, The effect of different cutoffs used to define positive growth 
phenotypic similarity as a function of genetic distance for metabolic models in the Biolog data. Different lines represent exponential fits to the experimental 
that predict growth on more than a given number of carbon sources. b, The values of phenotypic similarity based on different values of the Biolog 
frequency distribution of normalized Biolog scores for 40 species across 62 cutoff score. 


experimental growth conditions. The figure shows a bimodal pattern (scores of 
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Extended Data Figure 6 | Effect of growth media on the predicted 
phenotypic similarity trends. The black line shows an exponential fit to the 
predicted phenotypic similarity in the usage of carbon sources calculated on the 
basis of the in silico minimal media used in the study of growth phenotypes 
(see Methods and Supplementary Data 2) (n = 26,106). The dashed red line 
shows an exponential fit to the predicted phenotypic similarity using the 

in silico medium in which all carbon sources that could be imported by the 
models were made available, with the total combined uptake of carbon 
constrained to a maximum value of 10mM g_' dry weight (n = 27,261). 
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Extended Data Table 1 | Parameters of the exponential divergence models, describing the evolution of growth and genetic phenotypes 


Phenotype a b c 

Carbon source (biomass) 0.21 0.50 8.16 
Carbon source (ATP) 0.21 0.50 7.86 
Nitrogen source (biomass) 0.25 0.48 9.75 
Essentiality 0.46 0.24 13.6 
Synthetic lethality 0.037 0.27 9.98 


Values in the table show the parameters (a, b,c) of the divergence model y =a + be “ 


, where y represents the phenotypic similarity and t represents the genetic distance between species. Parameter a represents 


the saturation level of phenotypic divergence at long genetic distances, (a + b) represents the level phenotypic similarity at close genetic distances, and the parameter c quantifies the divergence rate. 
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Extended Data Table 2 | Model comparisons for predicted and experimentally determined phenotypic similarity as a function of genetic 
distance 


Phenotype 3-parameter exponential vs. linear model 3-parameter exponential vs. 2-parameter 
(relative likelinood based on Akaike weights) exponential model (F-test P-value) 
FBA carbon source similarity (biomass) >10"° < 107° 
Experimental carbon source similarity (Biolog) 9*10° 4.6*10° 
FBA essentiality similarity >10° < 10% 
Experimental essentiality similarity 27.2 8.5*10° 
FBA Synthetic lethality similarity >10” <10” 


Comparisons between the three parameter exponential model and the linear model were performed on the basis of Akaike’s Information Criterion (AIC). Values in the table represent the Akaike-based relative 
likelihoods of the three-parameter exponential model compared with the linear model. Comparisons between the three-parameter exponential model and the nested two-parameter exponential model were 
performed using the F-test; the corresponding P values reflect the probability that the nested two-parameter model fits the data as well as the more complex three-parameter model. 
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Extended Data Table 3 | The predicted frequency of carbon and nitrogen source usage across metabolic models 


Nitrogen sources for biomass production 


Metabolite name 


Number of models 


L-Glutamic Acid 
a-D-Glucose 
D-Fructose 

L-Malic Acid 

L-Lactic Acid 

Maltose 

Glycerol 

L-Serine 

L-Aspartic Acid 
D-Mannose 
L-Arginine 
N-Acetyl-DGlucosamine 
D-Trehalose 

Sucrose 

Inosine 
a-Keto-GlutaricAcid 
L-Histidine 

L-Alanine 
D-GlucuronicAcid 
D-Serine 

a-D-Lactose 

Formic Acid 
Acetoacetic Acid 
D-Cellobiose 
D-Mannitol 

D-Malic Acid 
D-Sorbitol 
D-GalacturonicAcid 
Mucic Acid 
D-Saccharic Acid 
D-Gluconic Acid 
D-Galactose 

Citric Acid 
D-Raffinose 

Dextrin 
g-Amino-ButyricAcid 
L-Rhamnose 

Salicin 
N-Acetyl-b-D-Mannosamine 
Glucose-6-Phosphate 
D-Melibiose 
D-Aspartic Acid 
L-Fucose 

M-Inositol 

Quinic Acid 

Propionic Acid 
Stachyose 
N-Acetyl-Neuraminic Acid 
D-Arabitol 

Acetic Acid 
Fructose-6-Phosphate 
N-Acetyl-DGalactosamine 
a-Keto-ButyricAcid 
Gentiobiose 
b-Methyl-D-Glucoside 
D-Fucose 

Gelatin 
L-PyroglutamicAcid 
Pectin 
L-GalactonicAcid-g-Lactone 
P-HydroxyPhenyl AceticAcid 
a-HydroxyButyric Acid 
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Ammonia 

Urea 

L-Proline 
L-Glutamic Acid 
L-Valine 
L-lsoleucine 
L-Leucine 
Nitrate 

Nitrite 
L-Glutamine 
L-Serine 
L-Aspartic Acid 
Cytosine 
L-Arginine 
L-Ornithine 
Uracil 


N-Acetyl-D-Glucosamine 


Cytidine 
Adenosine 
Glycine 
L-Methionine 
Xanthine 
Histamine 
L-Histidine 
L-Tryptophan 
L-Alanine 
Ethanolamine 
D-Alanine 
D-Serine 
L-Lysine 
D-Glucosamine 
Putrescine 
Acetamide 
L-Phenylalanine 
Allantoin 
L-Tyrosine 
Formamide 
Inosine 
L-Cysteine 
L-Asparagine 
Thymidine 
Uridine 
Guanine 
Guanosine 


N-Acetyl-D-Mannosamine 


Xanthosine 
D-Aspartic Acid 
Methylamine 
Thymine 
L-Threonine 
D-Lysine 
Adenine 


N-Acetyl-D-Galactosamine 


L-Citrulline 
D-Galactosamine 
D-Glutamic Acid 
Tyramine 
b-Phenylethylamine 
L-Homoserine 

Uric Acid 

Agmatine 
Hydroxylamine 
Ethylamine 
L-Pyroglutamic Acid 
D-Asparagine 
D-Mannosamine 
D-Valine 

Biuret 


241 
241 
226 
203 
203 
183 
178 
155 
145 
136 
127 
124 


DODDDOOOHHNNNWWA HO 


Numbers in the table represent the total number of models, out of 322, predicted to use the corresponding carbon or nitrogen source. Metabolites are ranked from most to least frequent across models. 
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Extended Data Table 4 | Bacteria with experimental genome-wide data used to analyse the conservation of gene essentiality 


Species name Number of essential genes _ Reference PubMed ID 
Acinetobacter baylyi ADP 1 499 18319726 
Bacillus subtilis subsp. subtilis str. 168 276 14602916, 12682299 
Bacteroidetes thetaiotaomicron VP|-5482 325 19748469 
Burkholderia thailandensis E264 406 23382856 
Caulobacter crescentus NA1000 480 21878915 
Escherichia coli K-12 302 16738554 
Francisella novicida U112 396 17215359 
Haemophilus influenzae Rd KW20 667 11805338 
Helicobacter pylori 26695 336 15547264 
Mycobacterium tuberculosis H37Rv 689 23028335 
Porphyromonas gingivalis ATCC 33277 463 23114059 
Pseudomonas aeruginosa PAO1 774 14617778 
Salmonella enterica subsp. enterica serovar Typhi str. Ty2 331 23470992 
Salmonella enterica serovar Typhimurium str. SL1344 355 23470992 
Shewanella oneidensis MR-1 403 22125499 
Sphingomonas wittichii RW1 572 23601288 
Staphylococcus aureus subsp. aureus N315 168* 11952893 
Staphylococcus aureus subsp. aureus NCTC 8325 351 19570206 
Streptococcus pneumoniae 134+ 15995353 
Streptococcus sanguinis 218 22355642 
Vibrio cholerae O1 biovar El Tor str. N16961 344 23901011 


* Only incomplete data available, used to estimate conservation relative to S. aureus NCTC 8325. 
+ Only incomplete data available, used to estimate conservation relative to S. sanguinis. 
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The mammalian taste system is responsible for sensing and respond- 
ing to the five basic taste qualities: sweet, sour, bitter, salty and umami. 
Previously, we showed that each taste is detected by dedicated taste 
receptor cells (TRCs) on the tongue and palate epithelium’. To under- 
stand how TRCs transmit information to higher neural centres, we 
examined the tuning properties of large ensembles of neurons in 
the first neural station of the gustatory system. Here, we generated 
and characterized a collection of transgenic mice expressing a genet- 
ically encoded calcium indicator’ in central and peripheral neurons, 
and used a gradient refractive index microendoscope’ combined with 
high-resolution two-photon microscopy to image taste responses from 
ganglion neurons buried deep at the base of the brain. Our results 
reveal fine selectivity in the taste preference of ganglion neurons; dem- 
onstrate a strong match between TRCs in the tongue and the principal 
neural afferents relaying taste information to the brain; and expose 
the highly specific transfer of taste information between taste cells 
and the central nervous system. 

In mammals, taste receptor cells are assembled into taste buds that 
are distributed in different papillae in the tongue epithelium. Taste buds 
are innervated by afferent fibres that transmit information to the prim- 
ary taste cortex through synapses in the brainstem and thalamus‘. In the 
simplest model of taste coding at the periphery, each quality, encoded 
by a unique population of TRCs expressing specific receptors (for ex- 
ample, sweet cells, bitter cells, and so on), would connect to a matching 
set of ganglion neurons. Notably, although TRCs are tuned to preferred 
taste qualities> *, the nature of their functional ‘handshake’ with the ner- 
vous system has been a matter of significant debate’**"°. 

We reasoned that this fundamental question could now be resolved 
by directly examining the tuning properties of taste ganglion neurons. 
We focused on the geniculate ganglion, as its neurons innervate all taste 
buds in the front of the tongue and palate’, and opted to use two-photon 
calcium imaging to monitor tastant-evoked neural activity in vivo. This 
strategy, however, required the solution of two technical challenges: 
first, the ganglion is located in a bony capsule under the brain, some 
4mm from the surface, far beyond the reach of conventional micro- 
scopy; and second, geniculate ganglion neurons would have to be loaded 
with sensors of neuronal activity that can report function with good 
temporal, spatial and dynamic range. To solve the first challenge, we im- 
plemented the use of two-photon microendoscopy, where a gradient 
refractive index (GRIN) lens is used as an optical extension device*. The 
GCaMP family of genetically encoded calcium sensors are an attractive 
tool to solve the challenge of indicator loading’, yet there were no suit- 
able mouse lines or drivers appropriate for targeting geniculate ganglion 
neurons. Therefore, we generated a collection of 40 mouse lines expres- 
sing GCaMP3? driven by Thy1 (Fig. 1a), a neuronal promoter highly 
sensitive to position effects'', and screened for those that express the 
sensor in most geniculate ganglion neurons (Fig. 1b). Line 1 had essen- 
tially complete labelling of geniculate ganglion neurons (Fig. 1c), and 
stimulation of the ganglia ex vivo produced reliable calcium-dependent 


fluorescence changes (Fig. 1d). To take advantage of the more recent 
versions of GCaMP’"’, we subsequently developed a viral infection ap- 
proach that efficiently labels geniculate ganglion neurons via retrograde 
transfer of the virus from their terminal fields in the nucleus of the sol- 
itary tract (see Methods for details). 

To visualize geniculate ganglion neurons in live mice, we exposed a 
small ventral window into the ganglion (Fig. 2a), and carefully positioned 
a GRIN lens (1 mm diameter X 3.8 mm length) directly onto the tissue 
(Fig. 2b). This two-photon imaging configuration allowed unencum- 
bered access to the entire ganglion, and enabled the investigation of the 
geniculate ganglion at sufficient numerical aperture (~0.45 NA) so as 
to detect GCaMP-dependent fluorescence changes (Fig. 2c, d). 
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Figure 1 | Thy1-GCaMP3 transgenic mice express functional GCaMP3 in 
taste ganglion cells. a, Structure of the Thy1.2-GCaMP3 construct''. b, Whole- 
mount confocal images of geniculate ganglion from eight transgenic lines 
(top right) shows GCaMP3 expression in varying subsets of neurons. ¢, Line 1, 
used in our studies, expresses GCaMP3 in nearly all neurons (>90%, n = 6); 
compare Nissl staining (red) versus GCaMP3 fluorescence (green). d, Ex vivo 
calcium imaging of a geniculate from line 1 illustrating strong GCaMP3 
responses to a test depolarizing solution (KCI, 500 mM); over 75% of 

imaged neurons responded with AF/F greater than 100% (n = 25 cells, 

mean + quartiles). Scale bars, 100 um. 
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Figure 2 | In vivo two-photon microendoscopy of the geniculate ganglion. 
a, Diagram illustrating optical access to the geniculate ganglion. A 1-mm 
GRIN microendoscope’ was guided into the surgical opening, and imaged 
using two-photon microscopy. b, Bright-field image through the 
microendoscope (left), showing individual GCaMP-labelled neurons (right). 
c, Images of a ganglion with 25 GCaMP3 labelled neurons responding to 
sweet (acesulfame K, 30 mM), bitter (quinine, 5 mM), sour (citric acid, 50 mM) 
and salt (NaCl, 60 mM) tastants. Fluorescence amplitudes were pseudo- 
coloured according to AF/F (scale at right). d, Traces from six separate 
neurons (numbered in c) illustrating the time course of amplitude changes in 
GCaMP3 signals after sour stimulation. Horizontal bars mark the time and 
duration of the stimulus (inter-stimulus interval was 8 s). Scale bars: a, 4mm; 
b, 200 um and 50 um (magnification). 


Weassessed the responses of geniculate ganglion neurons to tastants 
using a range of stimulus paradigms that included the five basic taste 
qualities at concentrations that evoke strong behavioural and nerve re- 
sponses® (Extended Data Fig. 1). For most recordings the tongue was 
exposed to a 6.5-s pre-stimulus application of artificial saliva, a 2-s ex- 
posure to a test tastant, and a 6.5-s artificial saliva post-stimulus wash. 
Each session included a minimum of four trials per tastant, and a neu- 
ron was classified as a responder if it exhibited statistically significant 
responses in at least 50% of the trials (see Methods and Extended Data 
Table 1 for details). 

In the tongue, sweet and umami tastes are mediated by a small fam- 
ily of three G-protein-coupled receptors (GPCRs) that combine to form 
two heteromeric receptors: TIR1 and T1R3 for umami*"*” and T1R2 
and T1R3 for sweet*”*. Bitters are detected by a family of approximately 
three-dozen GPCRs, the T2Rs'°, and sour and sodium taste are sensed 
by ion channel receptors”®. Given a palette of five different tastes, and 
all possible neuronal tuning combinations, there are a total of 31 poten- 
tial neuronal classes: for example, five of these would be tuned to single 
taste qualities (one for each of the five basic tastes), ten doubly tuned, 
ten classes responding to three tastes, five to four tastes and a single class 
tuned to all five tastes. This raises the question of how taste is represented 
in the ganglion. 

We focused on a standard set of stimuli representing the five basic 
qualities: sweet-responding neurons were identified using sucrose, bit- 
ter with either quinine or cycloheximide, salt by stimulating with NaCl, 
sour with citric acid, and umami with monopotassium glutamate plus 
inosine monophosphate. Tastant stimulation of the tongue elicited strong 
calcium transients in subsets of ganglion neurons (Fig. 2c, d). The re- 
sponses were robust, reliable (Extended Data Fig. 2) and specific (Fig. 3 
and Extended Data Fig. 3). For example, salt responses showed the ex- 
pected blockage by amiloride’ (Fig. 3c and Extended Data Fig. 3e), bitter 
responses were blocked by allyl isothiocyanate (AITC)"’ (Fig. 3d and 
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Extended Data Fig. 3f), sweet/umami responses were abolished in knock- 
outs of the T1R3 receptor subunit® (Fig. 3e), and acid responses displayed 
the appropriate sensitivity to PKD2L1-cell silencing” (Extended Data 
Fig. 4). We examined the reproducibility of the taste-evoked responses 
by measuring variability across trials and, on average, cells responded 
in at least 75% of the trials (Extended Data Fig. 2). We analysed nearly 
1,000 neurons exhibiting tastant-evoked activity: 443 of these were derived 
from 15 Thyl-GCaMP3 mice and 432 from 14 AAV-GCaMP6 animals 
(see Extended Data Table 1); we also included approximately 50 cells 
each from AAV-GCaMP3 and AAV-GCaMP5 pilot experiments. Our 
data demonstrate that the vast majority of the responding neurons are 
strongly activated by only one taste quality, and thus exhibit highly 
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Figure 3 | Selective responses of geniculate ganglion neurons. a, Responses 
of ten ganglion neurons to the five basic taste qualities. Black traces mark 
average AF/F from four individual trials per tastant (grey traces); horizontal 
bars mark the time and duration of the stimulus. b, Individual ganglion neurons 
exhibit strong taste preferences. The graph shows a rank-ordered plot of 
calcium transient amplitudes for 152 sour-responsive neurons. For each cell, 
the mean sour response amplitudes (red) and the mean amplitude of its next- 
strongest tastant response (grey) are shown; minor dots indicate individual trial 
amplitudes. The vast majority of these sour cells are strongly tuned to sour 
taste versus any other taste quality. c, d, Responses are highly selective. The 
ENaC inhibitor amiloride*® blocks NaCl responses (c), while the bitter TRC 
inactivator AITC” abolishes bitter responses (d); individual traces (grey) 

and average traces (black) are shown for two representative ganglion neurons 
before (—) and after (+) pharmacological application of the blockers to the 
tongue. e, Mice lacking T1R3 lack ganglion responses to sweet and umami 
stimuli. Black bars denote control animals (n = 8) and red bars T1r3 
knockouts (n = 9). 
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Figure 4 | Representation of taste in the primary sensory ganglia. 

a, Response profile of 904 ganglion neurons from 37 mice according to 
taste preference; singly tuned cells (dark grey; see b for breakdown), bitter- 
sour cells (grey) and broadly tuned cells (light grey; see Extended Data 
Table 2). Importantly, the bitter-sour class actually reflects the activity of 


preferred and narrowly tuned tastant selectivity (Figs 3 and 4). Further- 
more, the representation of the various taste classes within the ganglion 
appears random, with no obvious topography or clustering (Extended 
Data Fig. 5). 

The finding that most ganglion neurons are tuned to single taste qual- 
ities substantiates a simple match between TRCs and ganglion neurons, 
with information from the tongue propagated primarily along labelled 
lines. This is inconsistent with models proposing a general across-fibre 
model of coding*, or that receptor TRCs first signal their information 
to broadly tuned presynaptic cells, which would then activate ganglion 
neurons”'®. We find no evidence or support for either of these models, 
as they would require that most of the ganglion neurons respond across 
multiple taste qualities. 

A key prediction of our findings is the expectation that responses to 
taste mixes should largely reflect the sum of the responses to the indi- 
vidual tastants (that is, as dedicated lines operating independently of each 
other). Therefore, we tested the representation of binary mixes relative 
to the responses to the individual components. We chose three pair sets 
including sweet, bitter, sour and salt tastes and examined their responses. 
Our results (Extended Data Fig. 6) demonstrate that taste mixes indeed 
behave like the simple addition of the responses to the individual tastants. 

This study also illustrates and uncovers three additional aspects to 
the representation of taste in the first neural station. First, it reveals that 
although there are 26 possible combinations of ‘multi-tuned’ responses 
(10 doubles, 10 triples, 5 quads and 1 that may potentially respond to all 
five classes of taste), only small numbers of bona fide multi-tuned neu- 
rons are found at over 1% (see Extended Data Table 2). Second, half of 
the apparently multi-tuned ganglion neurons belong to just one class, 
namely bitter-sour. And third, umami-responding neurons are divided 
between umami-alone and umami-sweet responders. 

We next questioned how these multi-tuned ganglions neurons might 
arise. They may be the result of signalling from two or more classes of 
TRCs converging into a single target neuron, or they may receive inputs 
from just one TRC type, but appear multi-tuned due to the nature of the 
stimulus. Indeed, it has long been known that some tastants, although 
single molecular species, may activate more than one class of TRCs. This 
is well exemplified by saccharin which powerfully activates sweet TRCs 
at low concentrations, but begins to activate bitter TRCs at high con- 
centrations (thought to be the basis for the bitter aftertaste of saccharin); 
another recently reported example is the activation of bitter cells by po- 
tassium salts (KCl) at high concentrations’’. Thus, we wondered whether 
the activation of bitter-sour ganglion neurons by acid (representing by 
far the largest class of apparently multi-tuned neurons; Fig. 4c) may 
also reflect tastant cross-talk, for instance a subset of T2R receptors in 
bitter cells being sensitive to low pH, as has been observed for a num- 
ber of GPCRs”. 

We reasoned that by using a combination of genetic manipulation 
and pharmacology it should be possible to explain the origin of these 
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T2R-expressing (bitter) TRCs (see text for details). We note that acid responses 
from bitter cells are not readily visible in whole-nerve recordings”"’, probably 
reflecting sensitivity differences between single-cell imaging and ‘bulk’ 
extracellular recordings. b, c, Distribution of ganglion neurons according to 
tastant selectivity. 


responses. For example, if bitter-sour neurons receive convergent input 
from T2R and PKD2LI cells, then ablating or silencing the PKD2L1- 
expressing sour TRCs should eliminate the bitter-sour doubly tuned 
class (that is, by eliminating their sour responses). In contrast, if these 
neurons reflect bitter cells which are sensitive to acidic stimuli, such ge- 
netic manipulation should have no impact on the activity or numbers 
of bitter-sour-responding ganglion neurons. We imaged mice in which 
we silenced synaptic transmission from PKD2L1-expressing cells by ge- 
netically targeting tetanus toxin’” (PKD2L1-tetanus). As expected the 
sour-alone neuronal responses were abolished, but, notably, the bitter- 
sour class was preserved (Extended Data Fig. 4). These results demonstrate 
that bitter-sour ganglion neurons do not receive input from sour-sensing 
TRCs, and strongly argue that the bitter-sour neurons are the result of 
T2R-expressing bitter TRCs being sensitive to acid stimuli. Iftrue, then 
blocking bitter taste responses should now eliminate the bitter—sour class. 
As predicted, Extended Data Fig. 4 shows this to be the case, further 
demonstrating that the vast majority of all ganglion neurons do receive 
input from single classes of TRCs. 

A surprise that emerged from this study was the finding that there were 
very few umami ganglion neurons, with many also activated by sweet 
tastants (Fig. 4). Sweet and umami taste receptors are formed by the com- 
bination of three GPCRs from the same gene family*'*”*. Interestingly, 
unlike humans, who can readily distinguish between sweet and umami 
tastes, rodents exhibit considerable behavioural cross-generalization bet- 
ween these two appetitive taste qualities’, even though they still repre- 
sent sweet and umami in two distinct but closely apposed cortical fields”. 
We suggest that this cross-generalization results in part from convergence 
of sweet and umami inputs into common ganglion neurons, perhaps as 
a consequence of their shared origin, and receptor repertoire. Future 
studies examining the fate, tuning and connectivity of individual TRCs 
throughout their life cycle should help resolve this question. 

The capacity to examine the activity of large ensembles of neurons 
with single-cell selectivity, even in a tissue buried deep in the head, has 
afforded a comprehensive examination of the behaviour of gustatory 
neurons in coding responses to the basic taste qualities with exquisite 
resolution. The development of this imaging preparation has addressed 
a long-standing question in the taste field, namely the direct demon- 
stration of labelled lines between TRCs and the brain. It should also 
provide a powerful experimental platform to identify molecular markers 
defining each class of taste ganglion neurons, help dissect the nature of 
their ‘handshake’ with the tongue and the brainstem, and explore the 
constancy of taste perception in light of the remarkably rapid turnover 
of TRCs”. 

Finally, we note that ganglion neurons not only convey taste quality, 
but also valence. In this regard, it would be of interest to determine if some 
of the neurons may be dedicated to carry attractive and aversive signals 
(that is, valence) rather than tastant ‘identity’ and thereby delineate the 
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circuit that mediates immediate taste ingestion or rejection responses, 
independent of conscious perception and the cortex”. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Transgenic animals and mouse strains. All procedures were carried out in accor- 
dance with the US National Institutes of Health (NIH) guidelines for the care and 
use of laboratory animals, and were approved by the Columbia University, Janelia 
Farm Research Campus, or National Institute of Dental and Craniofacial Research 
Animal Care and Use Committees. Reported data were obtained from mice ran- 
ging from 2-12 months of age and from both genders; randomization and blinding 
methods were not used. To generate Thyl-GCaMP3 mice, complementary DNA 
encoding GCaMP3 was cloned into the Xhol site ofa standard Thy! vector'!. More 
than 40 lines were generated by pronuclear injection with about 25 expressing sig- 
nificant GCaMP3 in subsets of neurons in the cranial ganglia and/or brain (Fig. 1). 
Calcium-imaging experiments reported here used line 1, which robustly expresses 
GCaMP3 in all geniculate ganglion neurons. All other mouse strains have been de- 
scribed previously*’’. Sample sizes were chosen to allow robust statistical analysis 
of data, no statistical method was used to determine sample size. 

Confocal imaging of isolated geniculate ganglia. Mice were perfused and geni- 
culate ganglia were exposed by removing the brain and bone dorsal to the ganglia as 
described previously”. The greater superficial petrosal and facial nerves were cut, 
and the ganglia were extracted by blunt dissection. To quantify the number of Thy1- 
GCaMP3-positive neurons per ganglia, fluorescent Nissl staining (NeuroTrace 530/ 
615, Molecular Probes) was used to label neurons, and the ratio of Nissl-positive to 
GCaMP3-positive neurons determined; greater than 90% ofall neurons are labelled 
by Thy1-GCaMP3 (n = 6 ganglia). Ganglia were mounted in Vectashield mount- 
ing medium and imaged with a Zeiss 510 confocal microscope (Zeiss X 10, 0.45 NA 
microscope objective). For control KC] stimulations, dissected ganglia were sub- 
mersed in imaging buffer (Hank’s Buffered Salt Solution with Ca** and 10 mM 
HEPES). Suture thread was used to mount the ganglia onto a coverslip within a 
custom imaging chamber. A depolarizing solution (potassium chloride, 500 mM) 
was applied to cells between washes with imaging buffer. 

Nerve recordings. Recording procedures and analysis were carried out as described 
previously*". 

Viral delivery of GCaMP sensors. An alternative strategy to express GCaMP in 
geniculate ganglion neurons involved injection of AAV constructs in the terminal 
field of taste-responsive neurons in the brainstem. Mice were anaesthetized with 
ketamine and xylazine (100 mgkg” ' and 10 mgkg" ', intraperitoneal), with subse- 
quent booster doses to maintain depth of anaesthesia. Body temperature was con- 
trolled using a closed-loop heating system. A small craniotomy (<1 mm diameter) 
centred approximately 6.5 mm dorsal to the bregma and 1.25 mm from the midline 
was performed. AAV carrying GCaMP constructs (AA V9-hSyn-GCaMP3, AAV1- 
hSyn-GCaMP6 s and AAV9-hSyn-GCaMP5G; Penn Vector Core) was delivered to 
the brain at three locations along the rostrocaudal axis: 1.25 mm/—4.0 mm (lateral 
coordinate relative to bregma/inferior coordinate relative to the dura), and —6.3 mm, 
—6.5mm or —6.7 mm (anterior coordinates relative to bregma); approximately 
200 nl was delivered per injection. After incision closure and recovery from sur- 
gery, mice were housed in their home cages for at least two weeks before imaging. 
Surgical preparation for in vivo imaging. A metal bar was affixed to the dorsal 
cranium of an anaesthetized mouse (see above) with cyanoacrylate as described 
previously*’. The mouse was positioned in a supine position, and its head rigidly 
secured using the metal bar. A tracheotomy was performed to maintain a clear air- 
way during tastant delivery to the oral cavity’. The surgical strategy used to image 
the geniculate ganglion in vivo was as previously described for rat”®. 

In vivo microendoscopy. A singlet microendoscope probe” (1 mm diameter, 0.42 
pitch, 700 jum working distance in water) was used to access the deep tissue. To mount 
the probe stably over the imaging area, melted agarose (1.5- 2.0% low-melting-point 
agarose) was applied to the exposed cavity. The microendoscope probe was care- 
fully positioned over the geniculate ganglia with forceps, using a stereoscope for 
visual guidance. The probe was held mechanically with forceps until secured by the 
hardened agarose. A conventional two-photon microscope (Prairie Technologies) 
using a Ti:Sapphire laser tuned to 920 nm was used for fluorescence excitation. A 
long-working-distance objective (Olympus X20, 0.4 NA) was used to couple light 
into the microendoscope. Images were typically acquired at 2-4 Hz (10-30 j1s pixel 
dwell time) over a ~175-350 um field of view, unless otherwise noted. We did not 
utilize multiple optical planes to avoid double-counting of cells. 

Tastant delivery. Lingual stimulation and recording procedures were performed as 
previously described*"*. Tastants were delivered (5-12.5 ml per min) using a feeding 
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tube positioned approximately 8 mm inside the oral cavity, dorsal to the tongue. 
Tastants dissolved in artificial saliva were delivered in serial order, interspersed with 
pure artificial saliva. Recordings were performed during epochs of continuous irri- 
gation; we did not detect calcium activity in response to artificial saliva application, 
nor did ganglion neurons exhibit mechanical responses. The concentrations of tas- 
tants used were: acesulfame K, 30 mM; sucrose, 300 mM; cycloheximide 100-1000 LM; 
quinine, 5 mM; sodium chloride, 60 mM; citric acid, 50 mM; monopotassium glu- 
tamate + inosine monophosphate, 50 mM + 1 mM, respectively. Pharmacological 
compounds were delivered as previously described*’’: amiloride, 0.010 mM, 8 ml 
delivered over 1 min; and allyl isothiocyanate (AITC), 3 mM, 40 ml delivered over 
5 min followed by artificial saliva rinse for 1 min. 
Calcium imaging data analysis. Image processing. Two-photon images were first 
motion-corrected by using cross-correlation-based image alignment” (Turboreg, 
ImageJ plugin). Responding neurons were identified with a semi-automated script 
that uses independent components analysis (ICA) to examine the spatial and tem- 
poral skewness of pixels within active cells”*. After cell segmentation, all putative re- 
sponding cells were verified by visually examining the raw imaging data. Cells not 
detected by ICA analysis were manually segmented. For some cellular regions of in- 
terest, there was spatial overlap with a neighbouring cell; if this was minor, they were 
manually segmented, otherwise they were discarded from further analysis. 
Scoring of tastant-responsive cells. We used two independent methods to identify 
tastant-evoked transients in ganglion cells. First, we visually scored cells, by directly 
observing the aligned image data displayed asa relative fluorescence movie, as well 
as the putative cells’ fluorescence time series. As a control, we used an automated 
technique to identify transients in the same cells that were visually classified. The 
data shown in the manuscript used the manual method, although both were largely 
interchangeable (manual:automatic: sweet cells, 105:95; bitter cells, 168:170; sour 
cells, 152:152; salt cells, 204:204; umami, 32:26). Because bona fide calcium tran- 
sients may significantly bias the estimates of location by mean and of dispersion by 
standard deviation, we used robust statistical measures: the median and Q,, estim- 
ator” (median absolute deviation also provides similar results). The Q,, estimator 
was calculated for each trial to quantify baseline fluorescence levels. The distribu- 
tion of Q,, estimator values was used to estimate the typical relative fluorescence 
background levels of multiphoton microendoscopy in the geniculate ganglion to 
be 0.04 + 0.01. We visually examined the raw images in outlier trials where back- 
ground levels exceeded 10% AF/F (2.5 times Q,,). Such trials typically contained in- 
accurate image registration and were excluded from further analysis. We assumed 
transients to be outliers of an underlying normal distribution of calcium levels dur- 
ing a cell’s resting state. Thus, transients were operationally defined as continuous 
time intervals where the fluorescence intensities significantly deviated from base- 
line levels (for example, modified Z-score > 2) for at least one fluorescence decay 
constant (for example, 650 ms for GCaMP3). In both manual and automated cases, 
we defined transients as tastant-evoked if their onsets occurred between the start of 
tastant delivery and the two seconds after the end of tastant delivery. Transient on- 
sets varied across animals and are believed to be due to factors associated with 
tastant arrival: placement of the tube within the oral cavity, tastant flow rate, and 
location of TRCs relative to the flow. Cells were classified as tastant-responsive if 
they responded in at least 50% of presentations for a given tastant. Finally, these 
cells were grouped by their responses to the five basic qualities (any of 31 potential 
categories, see text). Any category in which the number of contained cells was less 
than 1% of the total population was excluded from analysis (Fig. 4). 
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Extended Data Figure 1 | Thy1-GCaMP3 mice show normal physiological (m = 3; Student’s t-test; NaCl, P = 0.85; bitter, P = 0.46; sour, P = 0.94; 
responses to tastants. a, Representative nerve recording traces from control sweet, P = 0.69; umami, P = 0.77). Recordings were normalized to responses 
and Thy1-GCaMP3 mice in response to various tastants (see Methods for to KCl (500 mM). Horizontal bars below the traces mark the time and duration 
details). b, Quantification of neural responses (mean + s.e.m.) show that of the stimulus. 

Thy1-GCaMP3 mice (n = 4) are indistinguishable from wild-type mice 


Wildtype 
Thy1-GCaMP3 


100 


Normalized response (%) 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


Sweet Bitter 


b 

<< 80 S 80 

a a 

3) 3) 

oO oO 

‘5 40 ‘o 40 

® ® 

2 2 

§ § 

Zz 0 Zz O 

43 2 1 432 1 
Sweet responses Bitter responses 
c 
ir 

re) 30% 
i) 
Zz 


o 

a) 

S 

no 

s 

ao 
Extended Data Figure 2 | Reproducibility of tastant-evoked responses in cells respond all four times, three out of four, two out of four, and one 
geniculate ganglion neurons. a, Representative images of calcium-evoked out of four. c, Sample traces of four representative neurons challenged with 


GCaMP3 activity in response to sweet (left) and bitter (right) stimulation. Four _ 50 trials of the same tastant over a time window of 10 min. Note the high 
relative fluorescence images are shown from separate trials. In each trial, the _reliability in the activation of the neurons. This experiment also illustrates the 
identical cell populations were activated. b, We tested 105 sweet responding desensitization of bitter neurons (bottom traces) over time. Horizontal bars 
cells and 168 bitter responding cells for their reproducibility in our automated _ below the traces mark the time and duration of the stimulus. 

scoring algorithm for four trials. The histograms show the number of times the 
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Extended Data Figure 3 | Quantification of taste ganglion responses. trial amplitudes. e, Quantification of mean response amplitudes in singly tuned 
a-d, Rank-ordered plot of calcium transient amplitudes for various singly salt ganglion neurons before and after amiloride treatment (10 UM, n = 23 
tuned ganglion neurons (see text and Fig. 3). For each cell, the mean response cells; paired t-test, P< 0.001). f, Quantification of mean response amplitudes 
amplitudes for preferred stimulus (red) and the mean amplitude of its next- in singly tuned bitter cells before and after AITC treatment’ (3 mM, n = 63 
strongest tastant response (grey) are shown; minor dots indicate individual cells; paired t-test, P< 0.001). 
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Extended Data Figure 4 | Bitter-sour ganglion cells receive taste 
information from bitter T2R-expressing cells. Distribution of bitter, sour, 
and bitter-sour ganglion cells in a sample of control animals (n = 4) and in 
animals expressing tetanus toxin in PKD2L1-expressing TRCs (PKD2L1- 
TeNT; n = 3). As expected, no cells responsive to citric acid (50 mM) are 
detected in PKD2L1-TeNT mice’”. However, bitter-sour cells are unaffected 
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(see Fig. 4), suggesting that activation of T2R-expressing TRCs mediates these 
acidic responses. As predicted, subsequent application of the bitter TRC 
inactivator AITC”’ abolishes bitter responses of the bitter ganglion neurons, 
as well as the bitter and sour responses in the bitter-sour cells. Note that 

the solid bars showing less than 1 cell are used to illustrate the lack of 


responding cells. 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


a 


a 


Sess, 


| a 


b 
AAV GCaMP6s Thy1 GCaMP3 


preference in this field. Sour, yellow; sweet, magenta; NaCl, green; bitter, cyan. 
Scale bar, 200 jum. b, Representative fields of the geniculate ganglion from 

6 different mice. The random distribution of neurons representing the various 
taste qualities is independent of sensor, or method of sensor delivery/expression 


(AAV-GCaMP6 or Thy1-GCaMP3); colour scheme same as for a. 


Extended Data Figure 5 | Representation of taste quality does not cluster 
within the geniculate ganglion. a, Two-photon endoscopic image (left) of a 
geniculate ganglion expressing GCaMP3. Highlighted are the locations of the 
facial (VII) and greater superficial petrosal (GSP) cranial nerves. The right 
panel shows approximately 50 neurons colour-coded according to their taste 
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Extended Data Figure 6 | Representation of taste mixtures. a, Imaging fields 
of three representative geniculate ganglia illustrating the ensembles of neurons 
recruited by two different single taste stimuli presented separately (left panels) 
versus the ensemble of neurons activated by a mixture of the two compounds 
presented together (right panels). See text for details; as expected there are 
no mixture-specific responders, and very few cells responded to each tastant in 
the mix: only 3 out of 113 cells examined with bitter + sweet responded to both 
tastants, 5 out of 301 cells examined with sour + salty responded to both, 
and 0 of 39 examined with salty + sweet responded to both tastants. We 
note that sour stimuli are known to suppress sweet responses”’, but such 
suppression is sweet-cell autonomous and not due to interactions between 
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sweet and sour TRCs (data not shown). b, To quantitatively examine the impact 
of taste mixes on the responses of individual ganglion neurons, we analysed 
their response amplitudes in the presence of the single tastant versus the binary 
mix. Shown are plots of response amplitudes of a representative set of bitter, 
sweet, salty and sour geniculate neurons stimulated with their selective tastant 
(x axis) versus their response amplitude when in the presence of an additional 
tastant (as indicated in the y axis; shown are average AF/F over 4 trials). 95% 
confidence intervals were determined using a ratio f-test: bitter + sweet/bitter, 
0.73-0.91; sweet + bitter/sweet, 1.15-1.34; NaCl + sour/NaCl 0.74-1.00; 

sour + NaCl/sour, 0.95-1.16. 
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Extended Data Table 1 | Shown are the numbers of responding neurons analysed in each of the 14 AAV-GCaMP6s and 15 Thy1-GCaMP3 
mice 


AAV-GCaMP6s 
Animal Responding Cells Animal Responding Cells Animal Responding Cells 
1 70 2 57 3 51 
4 28 5 25 6 25 
7 24 8 21 9 19 
10 17 11 12 12 10 
13 10 14 9 
Thy1-GCaMP3 
Animal Responding Cells Animal Responding Cells Animal Responding Cells 
1 91 2 66 3 31 
4 30 5 26 6 26 
7 24 8 24 9 22 
10 21 11 21 12 18 
13 15 14 8 15 8 
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Extended Data Table 2 | The distribution of 971 taste ganglion neu- 
rons according to their responses to each of the five basic taste qualities 


Response profile Number of Cells 
NaCl 204 
Bitter 168 
Sour 152 
Sweet 105 
Umami 32 
Bitter - sour 125 
Sweet - umami 32 
Sweet - NaCl 32 
Sweet - sour 17 
Sweet - bitter 14 
Bitter - NaCl 12 
NaCl - sour 11 
NaCl - umami 8 
Sweet - bitter - sour 8 
Bitter - NaCl - sour r 
Sweet - NaCl - umami 6 
Bitter - sour - umami 6 
Sweet - sour - umami 5 
Sweet - bitter - sour - umami 5 
Bitter - umami 4 
Sweet - bitter - umami 3 
Sweet - NaCl - sour 3 
Bitter - NaCl - sour - umami 3 
NaCl - sour - umami 2 
Sweet - bitter - NaCl 2 
Sweet - bitter - NaCl - sour 2 
Sweet - NaCl - sour - umami 2 
Bitter - NaCl - umami 1 


Allthe data showing responses in at least 1% of the total population (above horizontal line) are included in 
Fig. 4 (those below 1% were found on average in less than 1 in 4 animals). Note that bitter-sour-tuned 
neurons reflect the activation of T2R-expressing TRCs (see text and Methods for details). 
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Control of plant stem cell function by conserved 
interacting transcriptional regulators 


Yun Zhou', Xing Liu, Eric M. Engstrom’+, Zachary L. Nimchuk?}, Jose L. Pruneda-Paz*, Paul T. Tarr', An Yan', Steve A. Kay® 


& Elliot M. Meyerowitz’? 


Plant stem cells in the shoot apical meristem (SAM) and root apical 
meristem are necessary for postembryonic development of above- 
ground tissues and roots, respectively, while secondary vascular stem 
cells sustain vascular development’ *. WUSCHEL (WUS), a homeo- 
domain transcription factor expressed in the rib meristem of the 
Arabidopsis SAM, is a key regulatory factor controlling SAM stem 
cell populations”*, and is thought to establish the shoot stem cell 
niche through a feedback circuit involving the CLAVATA3 (CLV3) 
peptide signalling pathway’. WUSCHEL-RELATED HOMEOBOX 
5 (WOXS), which is specifically expressed in the root quiescent cen- 
tre, defines quiescent centre identity and functions interchangeably 
with WUS in the control of shoot and root stem cell niches*. WOX4, 
expressed in Arabidopsis procambial cells, defines the vascular stem 
cell niche’"". WUS/WOxX family proteins are evolutionarily and func- 
tionally conserved throughout the plant kingdom” and emerge as 
key actors in the specification and maintenance of stem cells within 
all meristems’. However, the nature of the genetic regime in stem 
cell niches that centre on WOX gene function has been elusive, and 
molecular links underlying conserved WUS/W OX function in stem 
cell niches remain unknown. Here we demonstrate that the Arabidopsis 
HAIRY MERISTEM (HAM) family of transcription regulators act 
as conserved interacting cofactors with WUS/WOX proteins. HAM 
and WUS share common targets in vivo and their physical interac- 
tion is important in driving downstream transcriptional programs 
and in promoting shoot stem cell proliferation. Differences in the 
overlapping expression patterns of WOX and HAM family members 
underlie the formation of diverse stem cell niche locations, and the 
HAM family is essential for all of these stem cell niches. These find- 
ings establish a new framework for the control of stem cell produc- 
tion during plant development. 

To identify the molecular mechanism underlying WUS functions in 
stem cells, we screened for WUS-interacting transcription cofactors using 
yeast-two-hybrid assays with a transcription factor library’, and found 
that HAIRY MERISTEM 1 (HAM1) strongly and specifically interacts 
with WUS (Fig. 1a). HAM genes, encoding GRAS domain transcrip- 
tion regulators, contribute to shoot stem cell function in Petunia and 
Arabidopsis'*’. Four HAM genes (HAM1-HAM4) have been identified 
in Arabidopsis’*, and further yeast assays revealed that WUS also inter- 
acted with three other HAM family members (Extended Data Fig. 1a). 
WUS-HAM associations were confirmed by bimolecular fluorescence 
complementation (BiFC) assays in tobacco (Nicotiana benthamiana), 
in which WUS and HAM were fused to the amino- and carboxy-terminal 
halves of green fluorescent protein (GFP), respectively (GFPn and GFPc). 
Strong GFP fluorescence in nuclei was observed when GFPn-WUS was 
co-transformed with GFPc-HAM (Fig. 1b, cand Extended Data Fig. 1b-e). 
WOX4 and WOXS also interacted with HAM proteins in BiFC assays 
(Fig. 1d and Extended Data Fig. 1f-q). These WOX-HAM interac- 
tions were further confirmed through in vitro pull-down assays, where 


glutathione S-transferase (GST)-WOX¢4 but not GST bound HAM4- 
His., and GST-WUS but not GST bound HAM 1-Hisg (Fig. le). Inter- 
actions in planta were then tested using co-immunoprecipitation assays 
in tobacco, in which WUS-GFP bound Flag-HAM1 (Fig. 1f) and Flag- 
HAM2 (Fig. 1g), GFP-WOX4 bound Flag~-HAM4 (Fig. 1h), and WOX5- 
GFP bound Flag—HAM2 (Fig. 1i). In short, with multiple approaches, 
our work revealed physical interactions between HAM and WUS/WOX 
family members. 

We next constructed various deleted derivatives of HAM1 and WUS 
for yeast two-hybrid assays to identify essential regions for their inter- 
actions. Deleting amino acids from 117 to 230 (D117-230) in HAM1 
abolished the interaction (Extended Data Fig. 2a). This amino-terminal 
fragment is important for HAM] function in stem cell maintenance, as 
HAM1(D117-230) did not complement the ham1;2;4 early termina- 
tion phenotype, whereas full-length HAM1 driven by the same HAM1 
promoter did (Extended Data Fig. 2b-g), and it is conserved in HAM 
proteins from Arabidopsis and across different plant species (Extended 
Data Fig. 2h-j). Deletion analyses of WUS identified a carboxy-terminal 
region required for interaction with HAMI1 (Extended Data Fig. 3a), 
which is also required for WUS function (Extended Data Fig. 3b-d) and 
is conserved in different plant species (Extended Data Fig. 3e). 

To dissect the roles of the HAM-WUS interaction in controlling shoot 
stem cell niches, genetic interactions were analysed between ham1;2;3 
(lacking the function of three of four HAM genes) and the weak wus 
allele wus-7 (missense mutant), which forms a functional shoot apex’® 
similar to wild type in terms of vegetative and inflorescence meristems 
(Fig. 2a, b, e). Different from wus-7 single mutants (Fig. 2b) or ham1;2;3 
triple mutants (Fig. 2c), wus-7;ham1;2;3 quadruple mutants display early 
termination of vegetative meristem development (Fig. 2d), thus resem- 
bling wus complete loss of function (null) mutants’. This effect also 
occurred in wus-7/wus-7;ham 1/ham1;ham2/ham2;ham3/+ plants, in 
which 41 out of 45 plants showed strong termination of inflorescence 
and floral meristems, with only leaves (Fig. 2h) or barren pedicels (flowers 
without carpels) (Fig. 2g) left at the top of the main shoot, a phenotype 
typical of wus-1 null mutants’, but never observed in wus-7 (Fig. 2e) or 
ham1/ham1;ham2/ham2;ham3/+ (Fig. 2f) plants. Secondary inflores- 
cence meristems initiated from axillary meristems in wus-7/wus-7;ham1/ 
ham1;ham2/ham2;ham3/-+ plants also terminated prematurely (Extended 
Data Fig. 4a, b). Additionally, three out of four wus-7/wus-7;ham1/ham1; 
ham2/ham2;ham4/+ plants displayed inflorescence meristem termi- 
nation and lacked carpels (Extended Data Fig. 4c). A dose-dependent 
enhancement of stem cell termination was evident in wus-7;ham1/+; 
ham2/+;ham3/+ and wus-7; ham1/+;ham2/ham2;ham3/ham3 back- 
grounds (Extended Data Fig. 4d—f), demonstrating a functional interdepen- 
dence between WUS and HAM family members in vivo. Downregulation 
of HAM1, HAM2 and HAM3 ina ham4 shoot meristem, through acti- 
vation of the microRNA MIR171—reported to target the HAM1, HAM2 
and HAM3 genes'°—led to terminated vegetative development (Extended 
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Figure 1 | WUS/WOX and HAM family proteins physically interact. a, LacZ 
activity in yeast two-hybrid assays. AD, activation domain; DBD, DNA- 
binding domain. Error bars show mean ~ standard error of the mean (s.e.m.) 
(n = 3 biological replicates). ***P < 0.001 (two-tailed t-test). b-d, BiFC in 
tobacco. Panels (left to right): GFP; propidium iodide (PI) staining; merged 
channels. Scale bars, 20 jum. e, SDS—polyacrylamide gel electrophoresis 
(SDS-PAGE) of input recombinant proteins stained by Coomassie blue (left), 
and pull-down of His,-tagged HAM proteins through GST-tagged WUS/WOX 
proteins detected by immunoblotting with anti-His antibody (right). 

Asterisk indicates HAM1-His, band and numbers indicate the apparent 
molecular weight of the protein bands in the protein standard. f-j, Co- 
immunoprecipitation of WUS-GFP and Flag-HAM1 (f), WUS-GFP and 
Flag-HAM2 (g), GFP-WOX¢4 and Flag-HAM4 (h), WOX5-GFP and 
Flag-HAM2 (i) (see Methods). IB, immunoblot; IP, immunoprecipitation. 


IPSGEP. IP: GFP 


Data Fig. 4g, h) similar to the wus-1 phenotype, suggesting that WUS 
alone is not sufficient to maintain SAMs in the absence of HAM activity. 
Finally, the wus-1;ham1;2;3 quadruple homozygote resembles a wus-1 
single mutant in several aspects including the vegetative meristem (Ex- 
tended Data Fig. 4i-1), suggesting that WUS and HAM genes could act 
together at the SAM. All these genetic data are consistent with the 
hypothesis that WUS and HAM function as partners in shoot meri- 
stem maintenance. 

In addition to genetic interactions, the molecular function of the WUS- 
HAM interaction was further investigated. First, quantitative PCR with 
reverse transcription (RT-PCR) results (Fig. 3i) demonstrated that HAM 
proteins regulate expression of a set of genes including JAZ5, TIP2;2, 
TCP9, GRP23 and TPL, which were reported to be directly regulated by 
WUS”. These WUS downstream targets were misregulated in wus-7 or 
ham1;2;3 triple mutants in similar manners, and wus-7 and ham1;2;3 
synergistically regulated their expression (Fig. 2i), consistent with func- 
tional physical (Fig. 1) and genetic (Fig. 2a—-h) interactions between WUS 
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Figure 2 | WUS and HAM family genes cooperatively control the shoot 
stem cell niche and co-regulate a common gene set. a—h, Shoot apices 
(a-d) (arrows) and inflorescence structures (e-h) of plants of indicated 
genotypes (Ler, wild type). Scale bars, 2 mm. i, RT-PCR quantification of WUS 
and HAM target gene expression in indicated genotypes. Error bars 

show mean + s.e.m. (n = 3 biological replicates). j-m, Ratio of firefly 
luciferease (LUC) to Renilla luciferase (REN) activity in tobacco cells 
co-transformed with different reporter constructs (structure above each graph) 
and indicated effectors (see Methods). Min35S, 60-base-pair 35S minimum 
element; LB, transfer DNA (T-DNA) left border; RB, T-DNA right border. 
Error bars show mean + s.e.m. (1 = 3 biological replicates). n, 0, ChIP of 
HAM? protein with TPL or GRP23 chromatin regions, with amplicon locations 
(bars with numbers) shown above each graph. The ChIP experiments were 
repeated three times using independent biological replicates with similar 
results, and one representative data set is shown. i-o, *P < 0.05, **P < 0.01, 
*** P< 0.001 (two-tailed t-test). 


and HAM. Second, dual luciferase assays were conducted in planta to 
confirm the direct effects of WUS-HAM on target gene expression. 
Compared with empty-vector controls, the target genes examined were 
moderately (Fig. 2j-k) or barely (Fig. 21, m) regulated by WUS or HAM 
alone, but were markedly affected when WUS and HAM were combined 
(Fig. 2j-l), indicating a role for the WUS-HAM interaction in regulat- 
ing their transcription activities. Last, chromatin immunoprecipitation 
(ChIP) experiments demonstrated an in vivo association of yellow fluo- 
rescent protein (YFP)-HAM2 proteins with TPL (Fig. 2n) and GRP23 
promoters (Fig. 20), genomic regions similar to those reported to asso- 
ciate with WUS protein in vivo”, supporting the notion that HAM family 
members are functional WUS cofactors in controlling the shoot stem 
cell niche through regulation of common target genes. 

Consistently with physical and genetic interactions between HAM 
and WOX members, visualization of HAM and WUS/WOxX fluorescent 
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Figure 3 | HAM and WUS/WOX expression domains overlap. 

a-d, Expression of pHAM1::2* YPET-N7MICRORNASENSITIVE marker 
(pHAM1::2X YPET-N7mirS) (green) (a), pHAM2::2X YPET-N7mirS (green) 
(b), pWUS::DsRed-N7 (red) (c) in Ler inflorescence meristem, and 
pHAMI1::2X YPET-N7mirS (green) marker in a clv3-2 inflorescence meristem 
(d). Orthogonal (top) and transverse section (bottom) views of the same plant 
are shown. e-l, Overlapping expression patterns of pWUS::DsRed-N7 with 
PHAMI1::2X YPET-N7mirS or pHAM2::2X YPET-N7mirS in the same shoot 
meristems (see Methods). Panels (from left to right): dsRed (red); YPET, an 
improved version of YFP (green); PI (grey); merged channels. m, n, Expression 
of pHAM2::YPET-HAM2? translational marker (green) in L1 (m) and L3 (n) 
of the same ham1;2;4 SAM. o-t, Overlapping expression patterns of 
pHAM4::2X YPET-N7 and pWOX4::YFP (green, arrows) in the provascular 
and procambium cells in cotyledons (0, p), seedlings (q, r), and stem 
transverse sections (s, t). PI counterstain: red (a, b, d, m, n); green (c); 

grey (g, h, k, 1). Chlorophyll autofluorescence: red (0-t). Scale bars: 50 jum 
(d, s, t); 200 tum (0); 100 jim (p-r); 20 [um (a-c, e-n). 


transcriptional reporters revealed that WOX and HAM family expres- 
sion overlapped in planta. In vegetative (Extended Data Fig. 5c—h) and 
inflorescence meristems (Fig. 3a—c), HAM1 and HAM2 expression over- 
lapped with that of WUS in the rib meristem. HAM1 is expressed in the 
rib meristem and peripheral zone but not in the L1 or L2 layers of the 
central zone (Fig. 3a and Supplementary Video 1), while HAM2 expres- 
sion peaks within the centre of the rib meristem (Fig. 3b). Similarly to 
WUS (Extended Data Fig. 5a, b), HAM1 is negatively controlled by CLV 
signalling, as HAM1 is expressed throughout clv3-2 meristems (Fig. 3d). 
Weimaged the WUS and HAM1 or HAM2 reporters in the same SAMs 
(Fig. 3e-I). Although expressed broadly, signals from HAM1 (Fig. 3f) 
or HAM2 (Fig. 3j) overlap with WUS signals (Fig. 3e, i) in the same rib 
zone cells (Fig. 3h, 1 and Extended Data Fig. 5i-p). As the WUS protein 
has been reported to move in the SAM from its site of transcription in 
the rib domain”', the WUS and HAM1/HAM2 interaction domain in 
SAMs could be broader than their transcriptional domain overlap. We 
also examined the HAM2 translational reporter pHAM2::YPET-HAM2 
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in the ham1;2;4 SAM (Fig. 3m, n and Extended Data Fig. 6), which 
completely complements the ham1;2;4 triple mutant (Extended Data 
Fig. 6a—c), and it showed a pattern similar to the HAM2 transcriptional 
reporter: a strong signal in the centre starting from L3 and low or no 
signal in the L1 layer (Fig. 3m, n and Extended Data Fig. 6d, e). Taken 
together, the co-localization of WUS and HAM1/HAM2 in SAMs is 
consistent with functional WUS-HAM1/HAM2 interactions (Figs 1 
and 2). 

HAM4 and WOX4 are co-expressed in the provascular or procam- 
bial cell types of various tissues (Fig. 30-t and Extended Data Fig. 7). In 
stem transverse sections, HAM¢4is expressed specifically in the procam- 
bium, overlapping with WOX4 expression, as well as with the HAM3 
and HAM1 expression domains (Fig. 3s, tand Extended Data Fig. 7j-I). 
The tightly co-regulated spatial and temporal HAM4 and WOX4 expres- 
sion patterns are consistent with a WOX4-HAM.4 interaction module 
(Fig. 1h). Both HAM2 transcriptional and translational reporters (Ex- 
tended Data Fig. 8) are expressed in root meristem cells including the 
quiescent centre, overlapping with the quiescent-centre-specific WOX5 
expression domain®, consistent with previous reports from cell-type- 
specific transcriptome analyses” and indicating the possibility of 
WOX5-HAM2 interactions in roots. Our finding that both WUS and 
WOXkXsS interact with HAM2 may be partially accounted for by the fact 
that WUS and WOXS are interchangeable in controlling SAMs and 
root apical meristems*. Taken together, distinct and overlapping expres- 
sion patterns of HAM and WOX members indicate that specific HAM- 
WOX pairs function within different stem cell niches throughout the plant. 

To address the importance of the entire HAM family in the control 
of stem cell niches, we generated a ham1;2;3;4 quadruple homozygous 
mutant. Compared with wild type, ham1;2;3;4 plants displayed growth 
arrest at the early seedling stage, containing short roots and terminated 
shoots with two small leaf-like structures 26 days after germination (DAG) 
(Fig. 4a—c and Extended Data Fig. 9a—d); the shoot apices exhibited valley- 
like shapes at 26 DAG, lacking functional meristems (Fig. 4d); the 
hypocotyl transverse sections showed clear vascular defects, and the 
vascular bundles had reduced numbers of xylem vessels, fibres (dark- 
blue-stained) and phloem cells (red-stained), consistent with a reduc- 
tion in the stem cell activity necessary for generating these cell types 
(Fig. 4e, f). Moreover, mid-veins in ham 1;2;3;4 leaf-like tissues did not 
differentiate but instead accumulated a dark-staining cell mass, resem- 
bling ground tissue cells (Extended Data Fig. 9e, f). This is similar to, but 
much stronger than, the reported WOX4 RNA interference phenotype”. 
Root meristematic activity is also severely compromised in ham mul- 
tiple mutants. The quiescent centre and columella stem cells (CSCs) in 
ham1;2;3;4 mutants displayed enlarged and irregular shapes (Extended 
Data Fig. 9g, h) and, with incomplete penetrance, the CSCs in ham1;2;3 
mutants differentiate (Extended Data Fig. 9i-l), resembling reported 
defects in wox5 mutants*. However, the root phenotype of ham1;2;3 or 


Figure 4 | HAM family members are essential for various plant stem cell 
activities. Scanning electron microscopic imaging of wild-type (a) and 
ham1;2;3;4 (b-d) seedlings (26 DAG). Arrow indicates a ham1;2;3;4 plant 
lacking a functional SAM. e, f, Transverse sections of wild-type and ham1;2;3;4 
hypocotyls (7 DAG). Scale bars: 1 mm (a-c, e, f); 50 um (d). 
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ham1;2;3;4 plants is much more severe than that of the wox5 mutant, sug- 
gesting that HAM regulates root meristem development not only through 
direct interaction with WOXS but also through WOX5-independent 
pathways. In summary, in diverse meristems, ham 1;2;3;4 mutants dis- 
play defects that share similarities with mutants lacking WOX activ- 
ities, supporting the idea that HAM proteins are cofactors for WUS/ 
WOX-family-mediated stem cell niche maintenance. Given the evolu- 
tionary conservation of plant meristem cell niches and the WOX/HAM 
gene families’*”*, and the fact that WOX-HAM interactions exist in 
flowering plants besides Arabidopsis (Extended Data Fig. 10), this work 
establishes a new basis for studying stem cell niches in Arabidopsis, and 
provides a paradigm for meristem cell control regimes likely to be uni- 
versal in flowering plants. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Plant materials and growth conditions. Arabidopsis thaliana plants were grown 
in a sunshine soil/vermiculite/perlite mixture under continuous light at 20 °C. The 
mutant lines ham1;2;3 (triply homozygous for mutant alleles of ham1-1, ham2-1 
and ham3-1), ham1;2;4 (triply homozygous for mutant alleles of ham1-1, ham2-1 
and ham4-1), wus-7, wus-1, clv3-2 were previously described*"®'***, wus-7;ham1;2;3, 
wus-7;ham1;2;4, wus-1;ham1;2;3, and ham1;2;3;4 mutants were generated through 
genetic crosses, and identified based on PCR genotyping in the F2 segregating 
population. Different mutant combinations in an er background were chosen for 
genetic and morphological analyses. All of the phenotypes were confirmed from 
multiple independent segregation lines to control for differences in ecotype back- 
ground. PCR genotyping was performed as previously described’*"*. Reporter lines 
for p WUS::DsRed-N7 and pWOX4::YFP were previously reported'!”. 

Yeast two-hybrid assay. Yeast transformation and f-galactosidase assays were 
performed following the manufacturer’s instructions (Clontech). Full-length cDNAs 
for WUS, HAM1, HAM2, HAM3 and HAM4 were cloned into pENTR/D/TOPO 
or pCR8 (Invitrogen), and then WUS cDNA was Gateway cloned to pDEST32, and 
HAM1, HAM2, HAM3 and HAM4 cDNAs were Gateway cloned into pDEST22 
using standard LR reactions (Invitrogen). All of the deletion derivatives for WUS 
or HAMI were generated through overlapping PCR with the primers listed later, 
cloned into pENTR/D-TOPO or pCR8, and cloned into pDEST32 or pDEST22 
through LR recombination (Invitrogen). All clones were sequenced to confirm that 
they were in-frame and with designed deletions before being transformed into 
yeast. The bait and prey vectors were transformed into yeast strain MaV203, and 
three single transformed colonies per genotype were used as triplicate for the LacZ 
liquid assay in 96 Deepwell plates (Thermo) and optical density (OD) readings 
were recorded in a 96-well plate reader (Tecan). LacZ activity was calculated as 
(OD.429 nm X 1,000)/(ODg¢o0 nm X cell volume in pl X assay time in minutes) fol- 
lowing the yeast two-hybrid handbook (Clontech), including a standard error from 
three biological replicates. 

BiFC. For BiFC experiments, full-length Arabidopsis WUS, WOX4, WOX5, HAM1, 
HAM2, HAM3, HAM4, BARD1 and FAMA cDNA Gateway clones were recom- 
bined into vectors containing each half of GFP (N or C terminus) to generate the 
fusion proteins (GFPn—WUS, GFPn-WOX4, GEPn-WOXS, GFPn-BARD1, GFPn- 
FAMA, GFPc-HAM1, GFPc-HAM2, GFPc-HAM3, GFPc-HAM4, GFPc-BARD1, 
GFPc-FAMA) as previously described”*. Plasmid pairs for testing the specific inter- 
actions (such as GFPn-WUS and GFPc-HAM1) were co-transformed together with 
the P19 silencing suppressor”’ into N. benthamiana leaves through Agrobacterium 
infiltration. The infiltrated tobacco leaves were stained with PI and imaged using a 
Zeiss LSM 510 Meta confocal microscope two days after infiltration. Green GFP 
signals in nuclei (which demonstrate the physical interaction) and red PI staining 
signals (which indicate tobacco cell structure) were captured at the same time from 
different detection channels. A 488 nm laser line was used to stimulate GFP and PI. 
A 505-530 bandpass filter was used to collect GFP signal and a 585-615 bandpass 
filter was used to collect PI signal. BARD1, a nuclear-localized protein, was included 
as a negative control. FAMA, a bHLH transcription factor that has been demon- 
strated to interact with bHLH transcription factors”, was used as an additional 
negative control. The positive signals for each pair were confirmed with four inde- 
pendent biological replicates, and representative images are shown in the figures. 
The same method was also used for tomato (Solanum lycopersicum) proteins, includ- 
ing GFPn-tomato WUS, GFPn-tomato WOX4 and GFPc-tomato HAM. 
Co-immunoprecipitation and western blot analysis. WUS or WOX5 cDNA in 
pCR8 was recombined to pMDC83 (ref. 29) to generate a WUS-GFP or WOX5- 
GFP fusion clone. Flag~-HAM1, Flag~-HAM2 and Flag~-HAM4 were PCR amp- 
lified with primers 5’-CACCATGgactacaaggacgacgatgacaaggecgstgoaagtCCCTT 
ATCCTTTGAAAGGTTTCAAGG -3’, 5’-CTAACATTTCCAAGCAGAGACA 
GTAACAAGTTC-3’, and with primers 5’-CACCATGgactacaaggacgacgatgacaag 
gecestggaagtCCCCTGCCCITTGAGCAATTT-3’, 5/-TTAACATTTCCAAGCT 
GAGACAGTA-3’, and with primers 5’-CACCATGgactacaaggacgacgatgacaagggc 
getegaagt AAA ATCCCTGCATCATCTCCTC:3’, 5’-CTAAAACCGCCAAGCTG 
ATGTGGCAACAAG-3’, respectively (lower-case letters indicate coding sequences 
for Flag and a linker). GFP DNA was amplified and sub-cloned in front of WOX4 
cDNA in-frame to generate a GFP-WOX¢4 fragment. Flag~-HAM1, Flag-HAM2, 
Flag-HAM4 and GFP-WOX4 were then recombined into pMDC32 (ref. 29). For 
co-immunoprecipitation of WUS-GFP with Flag-HAM1, WUS-GFP with Flag- 
HAM2, GFP-WOX4 with Flag-HAM4, or WOX5-GFP with Flag-HAM2 in N. 
benthamiana, the constructs were introduced into N. benthamiana leaves through 
Agrobacterium infiltration. The leaves were harvested 2 days after infiltration and 
frozen in liquid nitrogen. For the immunoprecipitation of YFP-HAM2 in Arabidopsis, 
the shoot apices from the transgenic plants pHAM2::YFP-HAM2 in ham1;2;4 were 
harvested. The nuclei from Arabidopsis or tobacco were isolated, and then lysed 
with RIPA buffer (50 mM Tris-HCl, pH 7.5, 150 mM NaCl, 1% NP-40, 0.5% sodium 
deoxycholate, 0.1% SDS, 3 mM dithiothreitol (DTT), 2 mM NaF and 1 mM NaVOs, 
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or 50 mM Tris-HCl, pH 8.0, 150 mM NaCl, 1% NP-40, 0.5% sodium deoxycholate, 
0.1% SDS for co-immunoprecipitation of GFP-WOX4 with Flag~-HAM4) con- 
taining protease inhibitor cocktail (Roche) and 200 uM PMSF by incubation on ice 
for 30 min followed by brief sonication. Clear lysates were mixed with diluting buffer 
containing PMSF and protease inhibitor cocktail (50 mM Tris-HCl, pH 7.5, 150 mM 
NaCl, 3mM DTT, 2mM NaF and 1mM NaVO3, or 50mM Tris-HCl, pH 8.0, 
60 mM NaCl for co-immunoprecipitation of GFP-WOX¢4 with Flag-HAMA) (1:5, 
v:v), immunoprecipitated with GFP-Trap agarose beads (ChromoTek), and the 
beads were washed three times with the diluting buffer in spin columns (BioRad). 
The recovered proteins were eluted from the beads by boiling in 2X SDS sample 
buffer, separated by SDS-PAGE and transferred to nitrocellulose membrane 
(Millipore). Proteins were detected using anti-GFP antibody (Roche, catalogue 
#11814460001), anti-Flag antibody (Sigma, catalogue #F1804), and horseradish per- 
oxidase (HRP)-conjugated anti-mouse antibody (Promega, catalogue #W4021). The 
co-immunoprecipitation experiments were repeated twice with similar results. 
Protein expression constructs and protein purification. WOX4 cDNA was ampli- 
fied with primers 5’-CATAGAATTCATGAAGGTTCATGAGTTTTCGAA-3' 
and 5'-AGTTGCGGCCGCTCATCTCCCTTCAGGATGGAGAGGA-3’ (restric- 
tion enzyme sites are in bold), and cloned in-frame in pGEX-4T-1 with EcoRI and 
NotI sites. WOX5 cDNA was amplified with primers 5'-ATTTCCCGGGTATGT 
CTTTCTCCGTGAAAGGTCG-3’ and 5'-AGTTGCGGCCGCTTAAAGAAAG 
CTTAATCGAAGATCT-3’ (restriction enzyme sites are in bold), and cloned in- 
frame in pGEX-4T-1 with Xmal and Not! sites. WUS cDNA was amplified with 
primers 5'-CATAGAATTCATGGAGCCGCCACAGCATCAG-3’ and 5'-AGT 
TGCGGCCGCCTAGTTCAGACGTAGCTCAAGA-3’ (restriction enzyme sites 
are in bold), and cloned in-frame in pGEX-4T- 1 with EcoRI and NotI sites. HAM1- 
His, tag was generated from PCR with primers 5’-CATAGAATTCATGCCCT 
TATCCTTTGAAAGGTTTCAAGG-3’ and 5'-AGTTGCGGCCGCCTAGTGA 
TGATGATGATGATGACATTTCCAAGCAGAGACAGTAACAAGTTCTT-3’ 
(restriction enzyme sites are in bold, and Hiss coding sequence is underlined), and 
cloned in-frame with thrombin cutting site in pGEX-4T-1 with EcoRI and Not. 
HAM4-Hibsg tag was generated from PCR with primers 5'-CATAGAATTCATG 
AAAATCCCTGCATCATCTCCTC -3’ and 5’-AGTTGCGGCCGCCTAGTGA 
TGATGATGATGATGAAACCGCCAAGCTGATGTGGCAACAAG-3’ (restric- 
tion enzyme sites are in bold, and Hisg coding sequence is underlined), and cloned 
in-frame with thrombin cutting site in pGEX-4T-1 with EcoRI and NotI. All pro- 
teins were expressed in Rosetta Escherichia coli (Novagen) by inducing with 0.4 mM 
isopropyl--p-thiogalactoside (IPTG) at 16 °C for 2-4h. GST-WOX4, GST-WOX5 
and GST were purified on glutathione resin. HAM1 was purified on glutathione 
resin followed by digestion with thrombin and chromatography on $200 resins as 
described previously**?. HAM4 was purified on glutathione resin followed by 
digestion with thrombin and removal of the GST associated with glutathione resin 
as described previously’. 

In vitro pull-down assay. GST-WOX4, GST-WOX5, GST-WUS or GST were 
immobilized on glutathione resin and incubated with HAM1-His, or HAM4-His, 
for 30 min at 4°C. The glutathione resin was then washed three times and pro- 
cessed for SDS-PAGE analysis and western blot analysis using antibody to His-tag 
(Qiagen, catalogue #34660). The pull-down experiments were repeated twice with 
similar results. 

Transactivation assay in tobacco. A 60 base pair (bp) minimal 35S fragment (the 
—60 minimal promoter) was amplified and cloned with BamHI/Ncol sites into the 
pGREEN800II LUC” to generate a pGREEN800II-60LUC. TPL promoter was PCR 
amplified from Col-0 genomic DNA with primers 5'-AACAGGTACCGAACGC 
TTCGTTTCATTAGTTTATC-3’ and 5'-AATAGGATCCGTTTTCTCTCACT 
TCCTTAAAAGACT-3’ (restriction enzyme sites are in bold) and cloned with 
KpnI/BamHI sites into the pGREEN800II-60LUC. TIP2;2 promoter was amplified 
with primers 5’-AACAGGTACCCGAGTGAAGCAGATTGGGAGAGAA-3’ and 
5'-AATACTGCAGTTTGATCCGACAAAATAACTCTGTT-3’ and cloned with 
KpnI/Pstl sites into the pGREEN800II-60LUC. GRP23 promoter was amplified 
with primers 5’-AACAGGTACCCAGGTGTGATTGTCAATAGACTACG-3’ and 
5'-AACAGATATCGGTGGAGGGAAAATGATTTAGGGTT-3’ and cloned with 
KpnI/EcoRV sites into the pGREEN800II-60LUC. TCP9 promoter was amplified 
with primers 5'’-AACAGGTACCGTATGCTGATGGTAGGCAAAAGTT-3’ and 
5'-AATACTGCAGTAAAATATAGCTGAGAGAAAACG-3’ and cloned with KpnI/ 
PstI sites into the pGREEN800II LUC. The different reporter constructs (dual- 
luciferase reporter with different gene promoters) and indicated effectors (empty 
effector vector or WUS or HAM2, or WUS together with HAM2) were introduced 
into N. benthamiana leaves through Agrobacterium infiltration. The activities of 
LUC and REN were quantified 2 days after infiltration with a Dual Luciferase Assay 
kit (Promega), and luminescence was recorded using a 96-well dual injection lumi- 
nometer (Tecan). The LUC activity was normalized to the REN activity (LUC/REN). 
The means and standard errors of LUC/REN were calculated from three indepen- 
dent biological replicates. 
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Plasmid constructions for the transgenic plants. It has been previously reported 
that HAM1, HAM2 and HAM3 are targeted and repressed by the MIRI71 family”’. 
To generate new microRNA-sensitive fluorescence reporters for HAM1, HAM2 
and HAM3, an approach similar to that in a previous report** was used. Briefly, a 
2X YPET-N7mirS fragment was generated through PCR amplification, which con- 
tains a2 version of YPET with a N7 nuclear localization sequence (2 YPET-N7) 
followed by 26 bp of microRNA target sequence (GCAAGGGATATTGGCGCGG 
CTCAATC) from the HAM family. These 26 bp are recognized and targeted by the 
MIRI171 family’?**. 

For the construction of the pHAM1::2x YPET-N7mirS reporter, a 4kb Ascl 
fragment containing the HAM1 promoter was amplified from Col-0 genomic 
DNA with primers 5'-TACAGGCGCGCCTTTCCCTCACTTTTTCTTACATT 
GAA-3' and 5'-TACAGGCGCGCCACGCCTCCTCAACAACACAGAGTAA-3’ 
(restriction enzyme sites are in bold), and cloned 5’ of the 2x YPET-N7mirS frag- 
ment. The fused DNA fragment was introduced into the pMOA34 binary vector’. 
For the construction of pHAM2::2X YPET-N7mirS, the 3,122 bp HAM2 promoter 
was amplified with 5'-TACAGTTTAAACAGCAGGACATATCTAAACCAGA 
AGTT-3’ and 5'-TACAGTTTAAACGACCAATCTTACAGAGTCAGAAAGA 
G-3’ (restriction enzyme sites are in bold) and cloned in front of 2X YPET-N7mirS; 
and the 1,149 bp HAM2 3’ untranslated sequence was PCR amplified with 5’-TAC 
AGGCGCGCCGACGAAAAAGGAGGATATTTTCACGGT-3’ and 5’-TACAG 
GCGCGCCACTATGTTTCCATGTACTGTGGGATA-3’ (restriction enzyme sites 
are in bold) and cloned 3’ of the 2X YPET-N7mirS construct, then the fused DNA 
fragment was cloned into pMOA34. For the construction of pHAM3::2X YPET- 
N7mirS, the 3,816 bp HAM3 promoter was amplified with 5'-TACAGTTTAAAC 
TTTATAAGACTTGCTATGGTCGTGAG-3’ and 5'-TACAGTTTAAACTGCA 
GACGATAAAAAATAGTGTATT-3’ (restriction enzyme sites are in bold) and 
cloned before 2X YPET-N7mirS; and the 1,755 bp HAM3 3’ untranslated sequence 
was PCR amplified with 5’-TACAGGCGCGCCTTTCCACCGGAGTTTCAATT 
ATTAAA-3’ and 5'-TACAGGCGCGCCTTAGTTGAAGGACAAATAACACCA 
AA-3’ (restriction enzyme sites are in bold) and cloned 3’ of the 2X YPET-N7mirS 
fragment, then the fused DNA fragment was introduced into pMOA34. The double 
reporter lines, including the p WUS::DsRed-N7; pHAM1::2X YPET-N7mirS line and 
the pWUS::dsRed-N7; pHAM2::2X YPET-N7mirS line, were generated through 
genetic crosses. 

For the construction of the pHAM4::2X YPET-N7 reporter, the 6,413 bp HAM4 
promoter was amplified with primers 5’-TACAGGCGCGCCAAATATAAAAT 
AGAATCAAACAAAGTTGGTAAC-3’ and 5'-CAAAGGCGCGCCGTGTTGT 
GTGTTAAGAAGAAAGAAAGGTGGAGCCTTT-3’ (restriction enzyme sites are 
in bold), and cloned 5’ of a 2X YPET-N7 fragment, then the fused DNA fragment 
was cloned into pMOA34. 

For the complementation of wus-1, a full-length WUS or WUS derivative with- 
out base pairs encoding amino acids from 203 to 236 was cloned into the pMOA36 
binary vector, together with 4.4 kb of the WUS upstream regulatory sequence and 
1.5kb of the WUS 3’ untranslated sequence. The construct was introduced into 
wus-1/+ plants using the floral dip method. For the complementation of ham1;2;4, 
HAM1 or the HAM1 derivative without 117-230 was cloned into the pMOA34 binary 
vector, with 3,949 bp of the HAM] upstream regulatory sequence and 1,387 bp of 
the HAM1 3’ untranslated sequence. The construct was introduced into ham1;2;4 
plants using the floral dip method. 

To generate a MIR171 expression construct in shoot meristems, the MIR171 
DNA was amplified with 5’-CACCTGAGCGCACTATCGGACATCAAA-3’ and 
5'-TAAACGCGTGATATTGGCAC-3’ and cloned into pMOA36 together with 
4.4 kb of the WUS upstream regulatory sequence and 1.5 kb of the WUS 3’ untrans- 
lated sequence. The construct was introduced into the ham4 mutant through the 
floral dip method. Five independent transgenic plants (p WUS::MIR171 in ham4) 
showing terminated vegetative meristems were identified. 

Confocal imaging of fluorescence reporters in living plants. All of the fluor- 
escent reporters were imaged by using a Zeiss LSM 510 Meta confocal microscope, 
except for the fluorescent reporters in inflorescence meristems and HAM2 fluor- 
escent reporters in the roots, which were imaged by using a Zeiss LSM 780 Meta 
confocal microscope. Zeiss LSM software was used for reconstructing the Z-stacks 
for a projection view. Laser and filter settings were used as described previously****. 
To image HAM4 and WOX¢4 reporters, the cotyledons, first leaf, hypocotyls and 
roots from 7-day-old seedlings and stems from 1-cm bolting plants were used. To 
image dsRed, YPET and PI simultaneously in SAMs, the multitracking mode in 
the ZEISS LSM 780 was used. dsRed was excited using a 561 nm laser line in con- 
junction with 571-589 nm collection; YPET was excited using a 514 nm laser line 
in conjunction with a 519-549 nm collection; and PI was excited using a 514nm 
laser with 631-673 nm collection. There is no spectral bleed-through of dsRed 
into the YPET collection channel, nor of YPET into the dsRed collection channel 
under these settings, and for better display, all images from the dsRed channel were 


equally enhanced with the same scale and all images from the PI channel were 
uniformly enhanced to the similar intensity using ImageJ software. 

Histology. The wild-type and ham1;2;3;4 seedlings were fixed in 4% paraformal- 
dehyde, dehydrated and embedded in Paraplast X-tra (Fisher). The samples in wax 
were sectioned at 8 jtm, de-waxed and dehydrated, and the slides were stained with 
Alcian blue together with Safranin O (red) as previously described”, to detect non- 
lignified cell walls and lignified cell walls, respectively. 

Real-time RT-PCR analysis. Total RNA was isolated from 10-day-old plants with 
roots, hypocotyls and leaves dissected off, using the RNeasy Kit (Qiagen). Super- 
Script III reverse transcriptase (Invitrogen) was used to synthesize the first-strand 
cDNA with oligo(dT) primer and 1 1g of total RNA at 50 °C for 1 h. Quantitative 
PCR was then performed with the SensiMix SYBR Hi-ROX Kit (Bioline) on Roche 
Real-Time PCR machine following the manufacturer’s instruction. The thermal 
cycling program was 95 °C for 10 min, followed by 45 cycles of 95 °C for 10 s, 56 °C 
for 30 s, 72 °C for 40 s, anda one-cycle dissociation stage at 95 °C for 15 s, 60 °C for 
1 min, and 97 °C for 15s. The primers used in quantitative RT-PCR were: JAZ5, 
5'-GAAAGACAGAGCTGTGGCTAGG-3’ and 5’-TTGGCCTTCTTCAATCTT 
CATAATA-3’; TIP2;2, 5'-ACCAATGGCGAGAGCGTACCG-3’ and 5’-ATGA 
AACCGATAGCAATTGGAG-3’; TCP9, 5’-ACCTCCTTTACAAGTTGTTCCA 
AG-3' and 5'-TGAAGCTCTTGTTTCTCGTATATCTC-3’; GRP23, 5'-AGACA 
GCTAGCCATCAGCAGTCAC-3’ and 5'-AGTTCCTCAACTCCACTACCTTT 
TT-3'; TPL, 5'-AGCTAGTCTCAGCAATTCAAA-3’ and 5’-AGGCTGATCAG 
ATGCAGAGG-3’; and UBQ10, 5’-AACAATTGGAGGATGGTCGT-3’ and 5’-T 
TCCAGGGAAGATGAGACG-3’. Fold change was calculated as 2“ and stan- 
dard error was calculated from three biological replicates, and each biological repli- 
cate was examined in triplicate. 

ChIP. For the construction of pHAM2:: YFP-HAM2 (pHAM2::YPET-HAM2), the 
YFP variant YPET was amplified and cloned in front of HAM2 cDNA in-frame to 
generate the YFP-HAM2 fragment. Then the 3,122 bp HAM2 promoter was amp- 
lified with 5’-TACAGTTTAAACAGCAGGACATATCTAAACCAGAAGTT-3’ 
and 5'-TACAGTTTAAACGACCAATCTTACAGAGTCAGAAAGAG-3’ (restric- 
tion enzyme sites are in bold) and cloned in front of YFP-HAM2, and the 1,149 bp 
HAM23' untranslated sequence was amplified with 5’-TACAGGCGCGCCGAC 
GAAAAAGGAGGATATTTTCACGGT-3’ and 5’-TACAGGCGCGCCACTAT 
GTTTCCATGTACTGTGGGATA-3’ and cloned 3’ of YFP-HAM2. Then the whole 
fused DNA fragment (pHAM2-YFP-HAM2-HAM2 3' UTR) was cloned into the 
binary vector pMOA34. The construct was introduced into ham1;2;4 plants using 
the floral dip method, and the complemented ham 1;2;4 [pPHAM2:: YFP-HAM2] line 
was selected for the western blot, GFP immunoprecipitation (shown in Extended 
Data Fig. 6f) and ChIP experiments. 

A ChIP followed by a quantitative real-time PCR approach was used to investigate 
the in vivo association of HAM2 with the TPL and GRP23 promoters as described 
previously”? with some modifications. In general, 2 g of ham1;2;4 (negative con- 
trol) or ham1;2;4 [pPHAM2::YFP-HAM2] plants were harvested and fixed with 1% 
formaldehyde under vacuum. Nuclei were isolated and lysed, and chromatin was 
sheared to an average size of 500 bp by sonication seven times for 20s each with a 
Branson Sonifier. Samples were kept on ice during sonication and were cooled for 
1 min between sonication pulses. The sonicated chromatin served as input. Immu- 
noprecipitations were performed with GFP-Trap Agarose beads (Chromotek) at 
4°C following the manufacturer’s procedure. The precipitated DNA was isolated 
and purified, and served as a template for PCR. Quantitative PCR was performed 
as described earlier. The relative enrichment for each immunoprecipitated ampli- 
con (from TPL or GRP23 promoter) from GFP-Trap is presented as ChIP/input 
ratio, and TUA4 and ACTIN7 (ACT7) amplicons are also included to serve as nega- 
tive controls. The ChIP experiments were conducted three times using independent 
biological replicates with similar results, and one representative data set with two 
technical replicates is presented. The primer pairs used in ChIP-PCR areas follows: 
TPL amplicon 1, 5'-GCAATTGGCTCTTCAATGTC-3’ and 5'-GGACGGAGAT 
CTAACGGCTA-3’'; TPL amplicon 2, 5’-CCATATGACCGGGATATGAGA-3’ 
and 5’-GGGATATGTCGCTTTCCATT-3’; TPL amplicon 3, 5’-TTGAGTCAGG 
GCTCATCTCC-3’ and 5'-CTTTCGCGAGAACCAACTTC-3’; GRP23 amplicon 
1, 5’-ACCATCGTCATTGGTTTCGT-3’ and 5’-GGAGGTGACTGAGAGACA 
TGG-3'; GRP23 amplicon 2, 5'-CAACAAATTCCTGTTTTCACGTT-3’ and 5’-C 
GAAAATGTTCGAACTGCAT-3’; GRP23 amplicon 3, 5’-CGCCATCGCCTAA 
AAGTAAA-3’ and 5’-TTTGTTGGCTAGGCATAGGG-3’; GRP23 amplicon 4, 
5'-AGACAGCTAGCCATCAGCAGTCAC-3’ and 5'-AGTTCCTCAACTCCA 
CTACCTTTTT-3’; TUA4 amplicon, 5'-CTTTGGTCTTTAGCAGGTTC-3’ and 
5'-CCCATCTGTATATAACGACAC-3’; ACTIN7 amplicon, 5'-TGCTTGTTAT 
GTGATTCGATCC-3’ and 5’-GATCGACAGAAGCGAGAAGAAT-3’. 
Staining. mPS-PI staining and root imaging of the staining was performed as pre- 
viously described". 

Scanning electron microscopy. For scanning electron microscopy, tissue was placed 
in 1.2% glutaraldehyde in 0.025 M phosphate buffer (sodium phosphate, pH 6.8), 
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vacuum was applied for 10 min, and tissue was fixed overnight at 4 °C. Tissue was 
then rinsed twice with 0.025 M phosphate buffer for 1 h, post-fixed with 0.5% osmium 
tetroxide in 0.025 M phosphate buffer for 24h at room temperature, and moved 
through an increasing ethanol series (20% increments), each increment lasting a 
minimum of 1h and ending with two exchanges of 100% ethanol. Ethanol was 
removed by critical point drying with a critical point drier (SAMDRI), and tissue 
was mounted to stubs with double-sided adhesive tape and sputter coated with gold- 
palladium alloy using a Hummer Sputtering System (Anatech). Samples were exam- 
ined with a Hitachi 4700 scanning electron microscope. 

Primers used for cDNA clones and deletion constructions. HAM 1c/5CACC, 
5'-CACCATGCCCTTATCCTTTGAAAGGTTTCAAGG-3’; HAM1c/3, 5'-ACA 
TTTCCAAGCAGAGACAGTAACAAG-3’; HAM1c5/231, 5'-CCGTTTTATCAC 
AACAACCAG-3'; HAM1c5/441, 5'-GAAAATCTCAAAACATTCG-3'; HAMID71- 
116/5, 5'-AGTCCTCTCGCTTCTTATTCTGCTTCTITCTCCTGGTCAAGAGC-3’; 
HAMID71-116/3, 5'-GCTCTTGACCAGGAGAAGAAGCAGAATAAGAAGCG 
AGAGGACT-3'; HAM1c71/5CACC, 5'-CACCATGTCTACCACCACCACGCT 
GTCTTCCTCT-3’; HAM1D117-230/5, 5'-GATGATCTTGACGGTGTTCTCT 
CTCCGTTTTATCACAACAACCAGCAA-3’; HAM1D117-230/3, 5'-TTGCTGG 
TIGTTGTGATAAAACGGAGAGAGAACACCGTCAAGATCATC-3’; HAM4c/5, 
5'-ATGAAAATCCCTGCATCATCTCCTC-3’; HAM4c/3, 5'-AAACCGCCAA 
GCTGATGTGGCAACA-3’; WUSc5/1, 5'-ATGGAGCCGCCACAGCATCAG-3’; 
WUSc5/30, 5'-TACACGTGTCGCCAGACCAG-3'; WUSc5/100, 5’-AGATTCA 
ACGGAACAAACATGAC-3’; WUSc5/171, 5'-GCAAGCTCAGGTACTGAATG 
T-3'; WUSc/3stop, 5’-CTAGTTCAGACGTAGCTCAAGA-3'; WUSc3/236, 5'-A 
CCTTCTAGACCAAACAGAGG-3’; WUSc3/292, 5'-GITTCAGACGTAGCTCA 
AGAGAAGC-3'; WUSD164-183/5, 5'-TAACAAGCCATATCCCAGCTTCAAT 
GGCTACATGAGTAGCCATG-3’; WUSD164-183/3, 5'-CATGGCTACTCATG 
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Extended Data Figure 1 | Interaction between WUS/WOX and HAM family 
transcriptional regulators. a, LacZ activity in yeast-two-hybrid assays testing 
interactions between WUS and HAM2, HAM3 or HAM4. Error bars 

show mean + s.e.m. (n = 3 biological replicates). **P < 0.01; ***P < 0.001 
(two-tailed t-test, compared with DBD-WUS/AD). b-o, BiFC analyses in 
tobacco transient assays with HAM and WOX family genes. c—q, Tobacco was 
co-transformed with GFPn—-WUS and GFPc-HAM3 (b), GFPn—WUS and 
GFPc-HAM4 (c), GFPn—WUS and GFPc-FAMA (d), GFPn—WUS and 
GFPc-BARD1 (e), GFPn—WOX4 and GFPc-HAM1 (f), GFPn-WOX4 


GFPn-WUS/ 
GFPc-HAM4 


GFPn-WUS/ 
GFPc-BARD1 


GFPn-WOxX4/ 
GFPc-FAMA 


GFPn-WOX5/ 
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GFPn-WOX5/ 
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and GFPc-FAMA (g), GFPn—FAMA and GFPc-HAM1 (h), GFPn—WOX5 
and GFPc-HAM1 (i), GFPn—WOXS5 and GFPc-HAM2 (j), GFPn—WOXS and 
GFPc-HAM4 (k), GFPn—WOX5 and GFPc-FAMA (1), GFPn—-WOX5 and 
GFPc-BARD1 (m), GFPn—BARD1 and GFPc-HAMI (n), GFPn-BARD1 
and GFPc-HAM2 (0), GFPn-BARD1 and GFPc-HAM4 (p), or GFPn-FAMA 
and GFPc-HAM4 (q). BARD1 and FAMA proteins are both included as 
negative controls. Left panel: GFP channel; middle panel: propidium iodide 
(PI) staining channel; right panel: merged channels. Scale bars, 20 um. 
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Extended Data Figure 2 | An N-terminal region of HAM] is important for 3’ untranslated region (UTR) (c, f), but was fully complemented by wild-type 
WUS-HAM.1 interaction and is essential for HAM1 function in stem cell HAM 1 (d, g). b, c, Arrows indicate the early terminated inflorescences. 


maintenance. a, Yeast two-hybrid assay of interactions between WUS and h-j, Amino acid sequence alignment of the HAM1 N-terminal domains 
various deleted derivatives of HAM1. Deleting amino acids 117 to 230 (117-230) using Clustal Omega. h, Sequence alignment of the N-terminal 
(D117-230) from HAM1 compromised the WUS-HAM1 interaction. domains among three Arabidopsis HAM members. i, Sequence alignment of 
Left, box diagrams of the HAM1 derivatives. Shaded boxes indicate the partial N-terminal domains in HAM from A. thaliana, A. lyrata, Capsella 
GRAS domains. Numbers indicate amino acid residues. Error bars rubella, Brassica rapa and Petunia. j, Sequence alignment of partial HAM1 
show mean + s.e.m. (” = 3 biological replicates). *P < 0.05, **P < 0.01, N-terminal domains in HAM from A. thaliana, A. lyrata, C. rubella, B. oleracea, 
*** P< 0,001 (two-tailed t-test, compared with full-length AD-HAM1). B. rapa and Petunia. Asterisks indicate amino acids that are the same; dots 
b-g, The complementation of the ham1;2;4 triple mutant requires amino acids _ indicate similar amino acids. The conserved regions are boxed. Scale 
117-230. The early termination phenotype of ham1;2;4 (b, e) was not bars: 10 mm (b, c, g); 40 mm (d); 20 mm (e, f). 


complemented by HAM1(D117-230) driven by a HAMI1 promoter and 
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Extended Data Figure 3 | A C-terminal region of WUS is important for 
WUS-HAM1 interaction and is essential for WUS function in stem cell 
maintenance. a, Yeast-two-hybrid assay of interactions between HAM1 and 
various deleted WUS derivatives. Deleting amino acids 203 to 236 (D203-236) 
from WUS greatly compromised the WUS-HAM1 interaction. Left, box 
diagrams of the deleted WUS derivatives. Shaded boxes indicate the 
homeodomain; the three black boxes indicate the acidic domains, the WUS box 
and the EAR motif, respectively. Numbers indicate amino acid residues. Error 
bars show mean + s.e.m. (n = 3 biological replicates). *P < 0.05, **P < 0.01, 
***P < 0.001 (two-tailed t-test, compared with DBD-WUS full-length). 
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b-d, WUS function requires the same region that is important for WUS- 
HAM interaction. The terminated shoot meristem phenotype of wus-1 

(b) was not complemented by WUS(D203-236) driven by the WUS promoter 
and 3’ UTR (c), and was fully complemented by the wild-type WUS (d). 

e, Amino acid sequence alignment of C-terminal regions of WUS from 

A. thaliana, A. lyrata, C. rubella, B. oleracea, B. rapa, Lepidium ruderale, 

L. sativum and Petunia, using Clustal Omega. Asterisks indicate amino acids 
that are the same; dots indicate similar amino acids. The conserved regions are 
boxed. Scale bars, 2 mm (b-d). 
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Extended Data Figure 4 | Genetic interaction between WUS and HAM 
family members. a, b, The secondary inflorescence meristems initiated from 
axillary meristems in wus-7;ham1;2 homozygotes with ham3/+ terminate 
prematurely. c, wus-7;ham1;2 homozygotes with ham4/+ display early 
termination of the main inflorescence meristem and lack of carpels in flowers 
(indicated by arrow). d-f, WUS and HAM family members interact genetically 
in a dose-dependent manner. wus-7 (d) formed functional shoot apices and 
normal stature, but wus-7; ham1/+; ham2/+; ham3/+ (e) enhanced the wus-7 
phenotype, and wus-7; ham1/+; ham2; ham3 (f) showed stronger 
enhancement, with reduced flower numbers and plant stature, and an 


elongated vegetative stage, resembling a wus strong allele. Plants are at 36 days 
after germination. g, h, Downregulation of HAM1, HAM2 and HAM3 in ham4 
shoot meristems leads to an early termination phenotype. Compared to wild 
type (Col) (g), pWUS::MIR171 in ham4 (h) showed terminated vegetative 
meristems. i-l, WUS is required for the functions of HAM1, HAM2 and 
HAM3. At 11 days after germination, compared with Ler wild type (i) and 
ham1;2;3 (k), which formed functional vegetative meristem and leaf primordia, 
wus-1; ham1;2;3 (1) displays terminated vegetative meristems similar to 

wus-1 (j). Scale bars, 2mm. 
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Extended Data Figure 5 | Expression of HAM1, HAM2 and WUS in the 
SAMs. a, b, WUS expression in clv3-2. Orthogonal (a) and top (b) views of 
pWUS::DsRed-N7 expression (red) and chlorophyll autofluorescence (blue) in 
the same clv3-2 inflorescence meristem. c—h, Comparison between expression 
patterns of HAM1, HAM2 and WUS in vegetative meristems. c, Orthogonal 
view of pHAM1::2X YPET-N7mirS expression (green) in Ler vegetative 
meristem. d, Orthogonal view of pHAM1::2X YPET-N7mirS expression (green) 
together with chlorophyll autofluorescence (red) in the same vegetative 
meristem shown in ¢, indicating that HAM1 is expressed in the rib meristem. 
e, Orthogonal view of pHAM2::2X YPET-N7mirS expression (green) in Ler 
vegetative meristem. f, Orthogonal view of pHAM2::2X YPET-N7mirS 
expression (green) together with chlorophyll autofluorescence (red) in the 
same vegetative meristem shown in e, indicating that HAM72 is highly expressed 
in the rib meristem. g, Orthogonal view of pWUS::DsRed-N7 expression (red) 
in Ler vegetative meristem. h, Orthogonal view of p WUS::DsRed-N7 expression 
(red) together with chlorophyll autofluorescence (blue) in the same vegetative 
meristem shown in g, indicating that WUS is expressed in the rib meristem. 
Arrows indicate the positions of the L1 cell layer. i-p, Control images 
confirming the specificity of confocal spectral settings for Fig. 3 (e-l). The 
SAMs from the pWUS::DsRed-N7 line (i-l) or pHAMI::2X YPET-N7mirS line 
(m-p) were imaged from the same three separated channels used in Fig. 3 (e-I). 
There is no spectral bleed-through of YPET signal into the dsRed channel (m), 
nor dsRed signal into the YPET channel (j). i, m, dsRed channel (red); 

jn, YPET channel (green); k, 0, PI staining channel (grey); 1, p, merged all three 
channels. Scale bars: 50 jm (a-d, g, h); 20 tm (e, f, i-p). 
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Extended Data Figure 6 | pHAM2::YFP-HAM2 (pHAM2::YPET-HAM2) 
complemented the ham1;2;4 mutant and was expressed in the centre of 
SAMs. a-c, The early termination phenotype of ham1;2;4 (a, b) was completely 
complemented by YPET-HAM2 driven by the HAM2 promoter and 

3' UTR (c), indicating that the promoter used for HAM2 transcriptional and 
translational reporters is functional and that the fusion protein (YPET-HAM2) 
is also functional in vivo. a, b, Arrows indicate early terminated apices. 

a-—c, Scale bars, 10 mm. d, e, Different Z sections from the same SAM from a 
ham1;2;4 [pPHAM2::YPET-HAMZ] plant depicted in Fig. 3m, n shows 
expression of pHAM2::YPET-HAM2 translational marker (green) in L2 (d) 
and L3 (e), together with PI as counter stain (red). d, e, Scale bars, 20 jum. 

f, Immunoblot with anti-GFP antibody validates the presence of 
YFP-HAM2 (YPET-HAM2) in both nuclear lysate and nuclear proteins 
immunoprecipitated with GFP-Trap from ham1;2;4 [pHAM2::YFP-HAM2] 
line used in ChIP experiment (Fig. 2n, 0). IB, immunoblot; IP, 
immunoprecipitation. 
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Extended Data Figure 7 | Expression patterns of HAM genes in comparison 
with WOX4. a, pHAM4::2X YPET-N7 (green, indicated by arrow) is expressed 
in procambium cells of the first leaf. b, pHAM4::2X YPET-N7 (green, 
indicated by arrow) is expressed in vasculature in the 7-day-old hypocotyl. 
c-h, Comparison of pHAM4::2X YPET-N7 (green, indicated by arrow) and 
pWOX4::YFP (green, arrow indicated) expression patterns in vasculature cells 
in the 7-day-old leaf petiole (c, d), 20-day-old leaf petiole (e, f) and 7-day-old 
root (g, h). i, Orthogonal view of pHAM4::2X YPET-N7 (green, indicated by 
arrow) expression in flower vasculature. j, Procambium-specific expression of 
pHAM4::2X YPET-N7 in stems from 1-cm bolting plants. k-], Procambium- 
specific expression of pHAM3::2X YPET-N7mirS (k) and pHAM1::2X YPET- 
N7mirS (1) in transverse sections of stems from 1-cm bolting plants. Red 
represents chlorophyll autofluorescence (a-f, i-l), or PI staining (g, h). Scale 
bars: 50 tum (a, h-i, k-1); 100 tum (b-g, j). 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


pHAM2::YPET-HAM2 in ham1;2;4 


PHAM2::YPET-HAM2 in ham1;2;4 PHAM2::YPET-HAM2 in ham1;2;4 PHAM2::YPET-HAM2 in ham1;2;4 


Extended Data Figure 8 | Expression patterns of HAM2transcriptionaland Complete stacks of confocal sections through the root tip demonstrate that 
translational reporters in root meristems. a-i, Complete stacks of confocal pHAM2:YPET-HAM2 (green) is present in the quiescent centre cells (indicated 


sections through the root tip demonstrate that pHAM2::2X YPET-N7mirS by arrows) and the cells above the quiescent centre within the root meristem in 
(green) is expressed in the quiescent centre cells (indicated by arrow) and in the ham1;2;4 mutant. Cellular outlines were stained with PI (red). Scale 
cells above the quiescent centre within the root meristem. j-o, Expression bars: 20 [um (a-i); 50 Lum (j-o). 


patterns of HAM2 translational reporters in ham1;2;4 root meristems. 
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Extended Data Figure 9 | HAM family regulates various stem cell niches. 
a-d, Growth arrest of ham1;2;3;4 at the seedling stage. a, b, Imaging of Ler 
wild-type (a) and homozygous ham1;2;3;4 (b) seedlings at 7 DAG. c, d, Imaging 
of wild-type (c) and homozygous ham1;2;3;4 (d) (indicated by arrow) 
seedlings at 26 DAG. e, f, Transverse section of leaves from wild-type (e) and 
ham1;2;3;4 (f) at 7 DAG. f, Arrow indicates undifferentiated/undetermined 
cell mass. g, h, Confocal imaging of root meristem from wild-type (g) and 
ham1;2;3;4 (h) seedlings at 7 DAG. ham1;2;3;4 displayed enlarged cells with 
abnormal shapes at the quiescent centre (indicated by arrows) and CSC 
positions. g, h, Cellular outlines were visualized with PI staining (white). 

i-l, mPS-PI" stains indicate that HAM genes regulate root cell differentiation. 
Some CSCs (arrow indicated) undergo differentiation with starch accumulated 
and stained in homozygous ham1;2;3 (j, 1), but none of them can be stained 
in Ler wild type (i, k). Asterisks mark the quiescent centre cells. Scale 

bars:5 mm (c, d); 1 mm (a, b, e, f), 20 tum (g-I). 
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Extended Data Figure 10 | Interaction between WOX and HAM 


homologues from tomato (Solanum Iycopersicum). a, b, f, BiFC analyses 
in tobacco transient assays demonstrated that tomato WUS (NCBI gene 
accession number 543793) physically interacted with a putative tomato HAM 


homologue (sequence accession number: LEFL2052P11 from Kazusa 


Full-length Tomato cDNA database) (a) identified based on its sequence 
homology to HAM from Arabidopsis and Petunia (f), and that tomato WOX4 


(ref. 10) (NCBI gene accession number 100301933) physically interacted with 
the putative tomato HAM homologue (b). c-e, BARD 1 protein is included 
as a negative control. Left panel: GFP channel; middle panel: PI staining 
channel; right panel: merged channels. Scale bars, 20 um. f, Amino acid 
sequence alignment of a putative tomato HAM, Arabidopsis HAM1 and 
Petunia HAM using Clustal Omega. Asterisks indicate amino acids that are 
the same; dots indicate similar amino acids. 
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Broad CTL response is required to clear latent HIV-1 
due to dominance of escape mutations 
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Despite antiretroviral therapy (ART), human immunodeficiency virus 
(HIV)-1 persists in a stable latent reservoir’, primarily in resting 
memory CD4* T cells**. This reservoir presents a major barrier to 
the cure of HIV-1 infection. To purge the reservoir, pharmacolog- 
ical reactivation of latent HIV-1 has been proposed’ and tested both 
in vitro and in vivo’ *. A key remaining question is whether virus- 
specific immune mechanisms, including cytotoxic T lymphocytes 
(CTLs), can clear infected cells in ART-treated patients after latency 
is reversed. Here we show that there is a striking all or none pattern 
for CTL escape mutations in HIV-1 Gag epitopes. Unless ART is 
started early, the vast majority (>98%) of latent viruses carry CTL 
escape mutations that render infected cells insensitive to CTLs directed 
at common epitopes. To solve this problem, we identified CTLs that 
could recognize epitopes from latent HIV-1 that were unmutated in 
every chronically infected patient tested. Upon stimulation, these 
CTLs eliminated target cells infected with autologous virus derived 
from the latent reservoir, both in vitro and in patient-derived human- 
ized mice. The predominance of CTL-resistant viruses in the latent 
reservoir poses a major challenge to viral eradication. Our results 
demonstrate that chronically infected patients retain a broad-spectrum 
viral-specific CTL response and that appropriate boosting of this 
response may be required for the elimination of the latent reservoir. 

HIV-1 establishes latent infection in resting CD4* T cells*“. Recent 
efforts to eradicate HIV-1 infection have focused on reversing latency 
without global T-cell activation’. However, inducing HIV-1 gene expres- 
sion in latently infected cells is not sufficient to cause the death of these 
cells if they remain in a resting state’. Boosting HIV-1-specific immune 
responses, including CTL responses, may be required for clearance of 
the latent reservoir’. CTLs have a significant role in suppressing HIV-1 
replication in acute infection’® *. Because of this strong selective pres- 
sure, HIV-1 quickly acquires mutations to evade CTL recognition’**""". 
CTL escape has been studied primarily through the analysis of plasma 
virus’*'*1618-0, and CTL-based vaccines have been designed based on 
conserved epitopes*’”*. A systematic investigation of CTL escape in the 
latent reservoir will be of great importance to the ongoing CTL-based 
virus eradication efforts, because latent HIV-1 probably represents the 
major source of viral rebound after treatment interruption. Earlier 
studies have suggested the presence of CTL escape mutations in pro- 
viral DNA!*”, but it remains unclear to what extent the latent reservoir 
in resting CD4* T cells is affected by CTL escape, whether mutations 
detected in proviral DNA are representative of the very small fraction 
of proviruses that are replication competent, and, most importantly, 
whether the CTL response can recognize and clear infected cells after 
latency is reversed. 


To investigate CTL escape variants in the latent reservoir, we deep 
sequenced the proviral HIV-1 DNA in resting CD4* T cells from 25 
patients (Extended Data Table 1). Among them, 10 initiated ART during 
the acute phase (AP; within 3 months of infection) while the other 15 
initiated ART during the chronic phase (CP) of infection. The sequenc- 
ing was focused on Gag because it is an important target of the CTL re- 
sponse” and is highly conserved, which facilitates the detection of escape 
variants. Our data show that previously documented CTL escape var- 
iants completely dominate the viral reservoirs of nearly all CP-treated 
patients (Extended Data Fig. 1 and Supplementary Table 1). This trend is 
especially obvious for several well characterized CTL epitopes: the human 
leukocyte antigen (HLA)-A2-restricted epitope SLYNTVATL (SL9), the 
HLA-A3-restricted epitope RLRPGGKKK (RK9) and the HLA-B57/58- 
restricted epitope TSTLQEQIGW (TW 10) (Fig. la and Extended Data 
Fig. 1, highlighted in coloured boxes.). In these epitopes, close to 100% 
of the sequences harboured escape mutations. Comparison of muta- 
tion frequencies between HLA allele-relevant and -irrelevant epitopes 
in CP-treated patients suggests that the CTL escape mutations identified 
are specific to each patient’s HLA type (Fig. 1b). By contrast, except for 
SL9 from patient APO1 and RK9 from patient AP08, few if any CTL 
escape mutations were archived in AP-treated patients (Fig. 1c and Ex- 
tended Data Fig. 1). The striking difference between AP- and CP-treated 
patients (Fig. 1c) indicates that, unless treatment is initiated within the 
first several months of infection, the latent reservoir becomes almost 
completely dominated by variants resistant to dominant CTL responses. 

To confirm that variants detected at high frequency in the latent res- 
ervoir represent functional CTL escape mutants, cells from seven CP- 
treated subjects were tested for reactivity to synthetic peptides representing 
wild-type and mutant versions of the relevant epitopes. As expected, 
there were only minimal responses to previously documented CTL escape 
mutants by patient CD8* T cells, and no de novo response was detected 
(Fig. 1d and Extended Data Fig. 2). In contrast, all tested subjects retained 
a strong response to peptides representing the wild-type epitopes, sug- 
gesting that the wild-type virus was initially transmitted, with subsequent 
evolution of CTL escape variants. Most HIV-1 proviruses detected in 
patients are defective’. Therefore, to determine whether these CTL 
escape variants can be reactivated and lead to viral rebound if therapy 
is stopped, we isolated replication-competent viruses from the latent 
reservoirs of nine CP-treated patients. We found that all the dominant 
CTL escape mutations that had been identified in proviruses in resting 
CD4* T cells were also present in the replication-competent viruses that 
grew out after T-cell activation (Fig. le and Extended Data Fig. 3), indicat- 
ing that these CTL escape variants not only dominate the population of 
proviruses, but can also be released and replicate once latency is reversed. 
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HIV-1 Gag CTL epitopes 


Figure 1 | CTL escape variants dominate the latent reservoir of CP-treated 
but not AP-treated patients. a, Frequency of variants in Gag CTL epitopes 
in proviruses from resting CD4~ T cells. Representative results from six 
patients are shown. Only optimal CTL epitopes relevant to each patient’s HLA 
type are listed. Results from both PacBio (left bar) and MiSeq (right bar) 
sequencing are shown. The effect on CTL recognition (denoted by colour) is 
determined from information in the Los Alamos National Laboratory (LANL) 
HIV Molecular Immunology Database. b, CTL escape variants identified by 
sequencing are specific to HLA type. Frequencies of documented escape- 
associated variants in four well characterized epitopes are shown for all 15 
CP-treated patients. Medians and P values from Mann-Whitney test are 
shown. ¢, Comparison of CTL escape variant frequency in proviruses between 
CP- and AP-treated patients. Only well characterized epitopes are shown. 


We next asked whether the host CTL response could recognize and 
eliminate the cells infected with these escape variants. We infected acti- 
vated CD4* T cells from these patients with autologous, replication- 
competent virus derived from the latent reservoir (Extended Data Fig. 4a). 
The infected cells were then co-cultured with autologous CD8* T cells, 
either unstimulated or pre-stimulated, to assess HIV-1-specific cyto- 
lytic activity. Non-specific activation of CD8* T cells was not observed 
after co-culture with phytohaemagglutinin (PHA)-activated CD4* T 
cells (Extended Data Fig. 4b). From all 13 CP-treated subjects tested, 
CD8* T cells pre-stimulated by a Gag peptide mixture efficiently killed 
autologous infected CD4° T cells (median 61% elimination), while 
unstimulated CD8* T cells from most subjects had significantly less 
effect (median 23% elimination) (Fig. 2a and Extended Data Fig. 4c, d). 
CD8* T cells from 7/7 healthy donors completely failed to eliminate 
autologous infected cells (Fig. 2a), confirming that the observed killing 
was HIV-1 specific. The killing effect was enhanced by increasing the 
effector to target ratio (Extended Data Fig. 5a), and was cell-cell contact 
dependent (Extended Data Fig. 5b). When the co-culture was main- 
tained over time in the absence of ART, viral replication was significantly 
reduced, but not completely inhibited by pre-stimulated CD8™ T cells 
(Fig. 2b). We found that peptide mixtures from other HIV-1 proteins 
(Nef, Tat, Rev and Env) could also boost CTL responses and facilitate 
the elimination of infected cells (Fig. 2c), and that CTLs pre-stimulated 


382 | NATURE | VOL 517/115 JANUARY 2015 


") Unmutated CTL epitope jj Documented escape 


Medians and P values from Mann-Whitney test are shown. d, Characterization 
of CTL responses against HIV-1 Gag epitopes by interferon-y ELISpot. The 
peptides tested are listed below the x-axis (black type, epitopes in which 
sequence variation was detected; blue type, no variation). The observed 
mutation is underlined in red, and CTL escape (defined by the absence of 
positive response after mutation) is denoted by an asterisk above the bar. The 
peptide concentration was 10 ,g ml’. PBMCs, peripheral blood mononuclear 
cells; SFC, spot-forming cell. Error bars represent standard error of the mean 
(s.e.m.), n = 3. e, Sequences in Gag CTL epitopes for proviral DNA and 
outgrowth virus from resting CD4™ T cells in patient CP39. CTL epitopes with 
no observed variation are highlighted in blue. Epitopes with documented 
escape mutations are shaded in red. 


by Gag peptides generally had the highest activity. Together, these results 
demonstrate that chronically infected patients retain CTL clones that 
can recognize and eliminate autologous infected CD4” T cells, despite 
the presence of CTL escape mutations in dominant epitopes. However, 
these clones require stimulation with antigen for optimal activity. 

To characterize further which CTL population contributed to the 
elimination of cells infected by CTL escape variants, we compared the 
killing activity of two specific CTL populations: the population that targets 
epitopes in which escape has been identified and the one that targets 
unmutated epitopes (Fig. 3a). CD8* T cells from patients CP36 and 
CP39 were pre-stimulated with interleukin (IL)-2 and different synthetic 
peptides representing the wild-type forms of the relevant epitopes. After 
incubation for 6 days, each CTL population exhibited significant pro- 
liferation compared with no treatment or IL-2 alone (Fig. 3b, c). Pentamer 
staining for three available epitopes revealed that the number of epitope- 
specific CD8™ T cells increased dramatically after stimulation with wild- 
type peptides (Fig. 3b). After co-culture with autologous target cells 
infected with latent reservoir-derived viruses, CTLs targeting unmu- 
tated epitopes clearly showed stronger cytolytic activity than the IL-2 
only controls, while CTLs targeting epitopes with identified escaped 
mutations showed no significant killing (Fig. 3d). CTLs pre-stimulated 
by the Gag peptide mixture exhibited stronger killing than all single- 
peptide-stimulated populations (Fig. 3d). 
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Days post-infection 


To test whether CTLs that recognize unmutated viral epitopes can 
inhibit HIV-1 replication and clear infected cells in vivo, we generated 
patient-derived humanized mice using an improved version ofa recently 
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Figure 2 | CD8* T cells pre-stimulated with a mixture of Gag peptides 
eliminate autologous CD4* T cells infected with autologous HIV-1 from 
resting CD4* T cells. a, Pre-stimulated CD8* T cells (sCD8*) eliminate 
autologous infected CD4* T cells more efficiently than unstimulated CD8* 
T cells (uCD8"). Each symbol represents the mean of three replicates. Medians 
and P values from Mann-Whitney test are shown. b, sCD8° cells inhibit viral 
growth in autologous infected CD4* T cells with higher efficacy than uCD8* 
cells. p24, HIV-1 capsid (core) protein. c, sCD8* cells pre-stimulated by 
different viral peptides eliminate autologous CD4* T cells infected with viruses 
derived from resting CD4* T cells. b, c, Results were compared with cp4t only 
using paired t-tests. Error bars represent s.e.m., n = 3. *P < 0.05, **P < 0.01, 
***P < 0.001, NS, not significant (P > 0.05). 


reported mouse system named MISTRG”. Whereas the previously 
reported MISTRG mice bear a bacterial artificial chromosome (BAC) 
transgene encoding human SIRP-«, the newly generated MIS“? TRG 
mice harbour a knock-in replacement of the endogenous mouse Sirpa 
gene with a humanized version. With humanization by knock-in replace- 
ment of the Csf1, Csf2, 113, ie and Sirpa genes in the Rag2”'~ Il2rg”'~ 
genetic background, MIS“ TRG mice are highly permissive for human 
haematopoiesis and support the reconstitution of robust human lym- 
phoid and myelomonocytic systems. With the demonstrated development 
of functional T lymphocytes and monocytes/macrophages, MIS“ TRG 
mice provide a useful humanized mouse host for HIV-1 infection stud- 
ies. Bone marrow biopsies were obtained from study participants and 
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purified CD34* cells were used to reconstitute the MIS“ TRG mice. 
Weinfected these patient-derived humanized mice with primary HIV-1 
isolates grown from resting CD4* T cells from the same patient and 
then evaluated the antiviral effect of autologous CD8* T cells (Fig. 4a). 
MIS“ TRG mice engrafted with bone marrow CD34* cells from patient 
CP18 successfully developed human T-lymphocyte and monocyte/ 
macrophage subsets (Fig. 4b, c), which were sufficient to support HIV-1 
infection (Fig. 4d). Plasma HIV-1 RNA levels peaked 20-30 days after 
infection (Extended Data Fig. 6a). Depletion of CD4° T cells was 
clearly evident 12 days after infection in peripheral blood and spleen 
(Fig. 4e and Extended Data Fig. 6b, c). Cell-associated HIV-1 RNA was 
detected in both T cells and macrophages/monocytes (Extended Data 
Fig. 6d). Viral infection was also observed in various tissues in which a 
large number of memory CD4" T cells were detected (Extended Data 
Fig. 7). In control mice or mice that received autologous patient CD8* 
T cells pre-stimulated with a peptide representing the unmutated dom- 
inant SL9 epitope, levels of plasma HIV-1 RNA and proviral DNA in 
peripheral blood continued to increase from day 14 to day 29 after 
infection (Fig. 4f). In sharp contrast, mice that received CD8"* T cells 
pre-stimulated with unmutated epitopes (Gag mix or WF9) had a sig- 
nificantly lower level of viral replication (Fig. 4g, h). Dramatic decreases 
in plasma HIV-1 RNA of 100- to 1,000-fold were observed in all three 
mice that received CD8* T cells pre-stimulated with the mixture of 
Gag peptides including dominant and subdominant epitopes. Two of 
three mice had undetectable levels of plasma HIV-1 RNA and proviral 
DNA in peripheral blood measured at three time points (Fig. 4g). We 
performed the same experiments using patient CP36-derived human- 
ized mice and a reduction of peripheral HIV-1 RNA and DNA levels was 
also observed in mice that received CP36 CD8* T cells pre-stimulated 
with the mixture of Gag peptides (Extended Data Fig. 8). Since the post- 
engraftment lifespan of MIS®? TRG mice is only 10-12 weeks?®, we were 
only able to investigate the acute phase of HIV-1 infection and dem- 
onstrate the in vivo functionality of patient CD8* T cells. Future devel- 
opments of the MIS“ TRG model will prolong the post-engraftment 
lifespan of these mice and allow studies of the establishment and clear- 
ance of the HIV-1 latent reservoir in vivo. Together, our in vitro and in 
vivo experiments demonstrate that only CTL clones targeting unmu- 
tated epitopes are effective against cells infected with the viral variants 
that are likely to represent the major source of rebound HIV-1 after 
reversal of latency. 

The seeding of the HIV-1 latent reservoir starts just a few days after 
infection’®, before the development of a robust CTL response™. This is 
consistent with our finding that patients who initiated treatment early, 
in the acute infection stage, have few ifany CTL escape variants archived 
in the latent reservoir. However, if treatment was initiated in chronic 
infection, CTL escape variants became dominant in the latent reservoir, 
indicating a complete replacement of the initially established ‘wild-type’ 
reservoir. The mechanism behind this replacement warrants further 
investigation, but probably reflects the dynamic nature of the reservoir 
in untreated infection. In any event, the overwhelming presence of 
escape variants in the latent reservoir of chronic patients certainly pres- 
ents an additional barrier to eradication efforts. The striking difference 
between AP- and CP-treated patients presents another argument for 
early treatment of HIV-1 infection; early treatment not only reduces 
the size of the latent reservoir’, but also alters the composition of the 
reservoir, as shown here, ina way that may enhance the efficacy of poten- 
tial CTL-based eradication therapies. 

The hierarchy of HIV-1-specific CTL response in acute infection 
appears to have an important role in initial viral suppression, as dem- 
onstrated by the fact that certain immunodominant CTL populations 
are frequently linked to lower set point viraemia later in infection’”™. 
These immunodominant responses in acute infection have been iden- 
tified as the major selection force driving the development of CTL escape 
mutations'*”°. Here we show that these immunodominant response- 
driven mutations are not only archived in the latent reservoir, but also 
in fact dominate the latent provirus population in CP-treated patients. 
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Figure 4 | Broad-spectrum CTLs suppress in vivo replication of HIV-1 from 
the latent reservoir of the same patients in patient-derived humanized mice. 
a, Experimental design. dpi, days post-infection. b, c, Efficient engraftment 
of patient CP18-derived haematopoietic cells in MIS“ TRG mice at week 6. 
Representative flow cytometry analysis (b) and summary (c) of human CD45* 
cells, human T-lymphocyte and monocyte subsets. Eleven out of fifteen 
mice (enclosed in the rectangle) were used for HIV-1 infection. h, human; 
m, mouse. d, Correlation between frequency of peripheral human CD45* cells 
(6 weeks after engraftment) and plasma HIV-1 RNA levels (14 days after 
infection). e, Depletion of human CD4* T cells in peripheral blood after HIV-1 
infection. *P < 0.05, paired t-test, n = 11. f, g, Reduction of levels of plasma 
HIV-1 RNA and copies of peripheral blood HIV-1 DNA after injection of 
viral-specific CTLs. Filled symbols, above detection limit; open symbols, below 
detection limit. *P < 0.05, unpaired t-test. h, Effect of CTLs on the level of viral 
replication in vivo. The area under the curve (AUC) of the viraemia versus 
time plot for each mouse in f (n = 3) and g (n = 6) before (AUC1) or after 
(AUC2) injection of CD8* T cells was calculated to represent quantitatively 
viral replication over time. Mouse SL9-2 was excluded from AUC analysis 
owing to early death. Error bars represent s.e.m. *P < 0.05, unpaired t-test. 


Therefore, directing CTL responses to unmutated viral epitopes is essen- 
tial to clear latent HIV-1. Owing to bias in antigen presentation or 
recognition”, common vaccination strategies will probably re-stimulate 
immunodominant CTL clones that do not kill infected cells after the 
reversal of latency. Stimulation of CTL responses with viral peptides 
circumvents antigen processing and is able to elicit broad-spectrum 
CTL responses against unmutated regions of viral proteins. Our study 
suggests that latent HIV-1 can be eliminated in chronically infected 
patients despite the overwhelming presence of CTL escape variants. 
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Future directions in therapeutic vaccine design need to focus on boost- 
ing broad CTL responses, as also reported elsewhere”’, and/or manip- 
ulating immunodominance. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Human subjects. Peripheral blood or bone marrow for the isolation of primary 
CD4*, CD8* T cells or CD34" cells was obtained from 30 HIV-1-infected patients 
(Extended Data Table 1) and 7 healthy adult volunteers. All patients had been on 
ART for at least 2 years and had maintained undetectable plasma HIV-1 RNA levels 
(<50 copies per ml) for at least 1 year before study. Ten AP-treated patients were 
recruited from the OPTIONS cohort at the University of California San Francisco 
(UCSF). This study was approved by the Johns Hopkins Internal Review Board 
and by the UCSF Committee on Human Research. Written informed consent was 
provided by all study participants. HLA typing for each patient was performed by 
the Johns Hopkins University Immunogenetics Laboratory. 

Sample preparation for deep sequencing. Peripheral blood mononuculear cells 
(PBMCs) were isolated from whole blood by Ficoll gradient separation. CD4* T 
cells were purified from PBMCs by negative selection using CD4* Isolation Kit II 
(Miltenyi). Resting CD4* T cells were then purified from CD4* T cells by negative 
selection using CD25, CD69 and HLA-DR microbeads (Miltenyi). Genomic DNA 
was extracted from 5 million resting CD4” T cells from each patient using QIAamp 
DNA Mini Kit (Qiagen). The gag gene was amplified from genomic DNA bya two- 
round nested PCR using these primers: 5’ outer primer (5’-TTGACTAGCGGAG 
GCTAGAAGG-3’); 3’ outer primer (5’-GATAAAACCTCCAATTCCCCCTAT 
C-3'); 5’ inner primer (5’-GAGAGATGGGTGCGAGAGCGTC-3’); 3’ inner primer 
(5'-CITGCTCCTGTATCTAATAGAGC-3’). For each patient, the entire genomic 
DNA from 5 million resting CD4* T cells was evenly distributed as a template into 
80 PCR reactions. The reactions were performed by using High Fidelity Plantinum 
Taq Polymerase (Life Technologies) following the manufacturer’s instructions. 
PCR amplicons were purified by gel extraction after gel electrophoresis. 

Deep sequencing. For PacBio RS single-molecule sequencing, amplicons were bar- 
coded with a group of 10 bp indexes and then multiple samples were pooled together 
to generate a SMRTbell sequencing library following the Pacific Biosciences tem- 
plate preparation and sequencing-C2 user guide for 2 kb insert size and using the 
Pacific Biosciences DNA template preparation kit. For MiSeq sequencing, the pooled 
amplicon DNA was end repaired, adenylated, and ligated to Illumina TruSeq adap- 
tors and PCR enriched for 10 cycles. The resulting library was then run on a bio- 
analyser high-sensitivity DNA chip for size and concentration determination. The 
library was then sequenced on MiSeq for paired-end 250 bp reads. The sequence 
reads from PacBio and MiSeq were demultiplexed using Fastx-Toolkit. 

Data analysis for deep-sequencing results. For the paired MiSeq reads, the two 
reads were first merged using FLASH®". MiSeq and PacBio reads from each indi- 
vidual were then aligned to the reference HIV-1 consensus B Gag sequence using 
Bowtie2 (ref. 32). A custom program was written using Perl scripts to identify and 
compute the frequency of all sequence variants that caused non-synonymous amino 
acid changes in each individual’s relevant optimal Gag epitopes (based on reported 
information in the HIV Molecular Immunology Database, Los Alamos National 
Laboratory (http://www hiv.lanl.gov/content/immunology/index.html)) according 
to their HLA type. For each individual, variants that occurred at a frequency >3% 
were retained. Additionally, for PacBio reads, sequences with identified premature 
stop codons were eliminated from the analysed results. For each identified vari- 
ation, the mutation type regarding CTL recognition was determined by matching 
with the information in the before-mentioned database. The five mutation types 
adopted in this paper are: documented escape (no CTL response when patient cells 
are challenged with the variant peptide); inferred escape (variant is predicted to be 
an escape mutant by longitudinal study or transmission study, but the reactivity of 
the variant is not tested experimentally); diminished response (experimental data 
suggest partial escape as evidenced by decreased CTL response); susceptible form 
(CTL response is elicited when patient cells are challenged with the variant pep- 
tide); and mutation type not determined (no experimental data on CTL recog- 
nition of this variant). 

ELISpot assays. The ELISpot assays were performed using Human IFN-y ELISpot 
PLUS kit (Mabtech) according to methods previously described*’ and the manu- 
facturer’s instructions. PBMCs were added at 200,000 cells per well and synthetic 
peptides were added in a final concentration of 0.1, 1 or 10 4g ml~'. A response 
was considered positive if it was threefold higher than the mean background (cell 
only control) and greater than 55 SFC per million cells. The number of specific T 
cells was calculated by subtracting the mean background values. 

Recovery and sequencing of patient viruses from resting CD4* T cells. Co- 
culture assays were performed to recover and amplify replication-competent viruses 
as previously described**. The viruses were recovered from 5-10 million resting 
CD4* T cells. The concentration of outgrowth viruses was determined by p24 
ELISA (PerkinElmer). Total RNA of outgrowth viruses was extracted using TRIzol 
LS reagent (Life Technologies). Residual DNA was then removed by TURBO DNase 
(Life Technologies) treatment. First-strand complementary DNA was synthesized 
using SuperScript III Reverse Transcriptase (Life Technologies) and the gag gene 
was amplified from cDNA using the gag outer primer pair mentioned above. The 


PCR amplicons were then purified by gel extraction and sequenced by regular 
Sanger sequencing. 

In vitro HIV-1 infection. PBMCs from HIV-1-infected patients and healthy donors 
were stimulated by adding 0.5 igml” ' PHA and IL-2 (100 U ml’) to basal media 
(RPMI with 10% heat-inactivated fetal bovine serum and antibiotics) for 3 days 
before isolation of CD4* T cells. Each patient’s activated CD4* T cells were infected 
with viruses recovered from the same patient’s resting CD4* T cells. Healthy donors’ 
CD4* T cells were infected with a laboratory strain virus, BaL. The virus concen- 
tration used in infection was equivalent to the p24 concentration of 200 ng ml '. 
All infections were performed by centrifugation of target cells with virus at 1,200g 
for 2h. 

Stimulation of CD8* T cells. PBMCs from CP-treated patients were cultured in 
the presence of IL-2 (100 U ml) with a mixture of consensus B Gag (or Nef, Rev, 
Tat, Env) peptides (800 ng ml! for each) (NIH AIDS Reagent Program), or with 
individual synthetic peptide (0.5 jig ml’) (Genemed Synthesis). CD8* T cells were 
purified after 6 days of incubation by positive selection using human CD8 microbe- 
ads (Miltenyi). To monitor CTL proliferation, PBMCs were stained with CFSE (Life 
Technologies) before incubation and with the relevant pentamer (Proimmune) 
after incubation. PBMCs were then stained with CD8-APC (Becton Dickson (BD)) 
and analysed by flow cytometry using FACS Canto II (BD). 

Co-culture of autologous CD4* and CD8* T cells. Three hours after infection, 
CD4* T cells were mixed with autologous unstimulated or stimulated CD8* T 
cells at a 1:1 ratio in basal media at 5 million cells per ml. Two days after co-culture, 
enfuvirtide (T-20, Roche) was added into the culture at 10 1.M to prevent further 
infection events except if the measurement was p24 ELISA. Three days after co- 
culture, cells were stained with CD8-APC (BD) first, fixed and permeabilized with 
Cytoperm/Cytofix (BD pharmingen), then stained for intracellular p24 Gag (PE, 
Coulter). Cells were analysed by flow cytometry using FACS Canto II (BD). For 
measurement of viral growth, 5 1l of supernatant was taken from the co-culture at 
days 0, 3 and 6, and subjected to p24 ELISA. For analysis of cell contact dependence, 
CD4* and CD8* T cells were placed in separate chambers of trans-well plates 
(0.4 um, Costar). 

Generation and infection of patient-derived humanized mice. The previously 
reported MISTRG mouse in the Rag2‘~ Il2rg’~ 129 X Balb/c (N2) genetic back- 
ground harbours knock-in replacements of the endogenous mouse Csf1, Csf2, 113 
and Tpo genes with humanized versions and a BAC transgene encoding human 
SIRP-c°. We generated the Sirpa“” mouse, which harbours a knock-in replace- 
ment of the endogenous mouse Sirpa gene with a humanized version. The Sirpa“? 
mouse will be thoroughly described elsewhere (manuscript in preparation). The 
improved MIS““” TRG mouse was generated by breeding Sirpa"? mice to MITRG 
mice. All animal experimentations were performed in compliance with Yale Insti- 
tutional Animal Care and Use Committee protocols. MIS“ TRG mice were main- 
tained with continuous treatment with enrofloxacin in the drinking water (Baytril, 
0.27 mg ml *). Patient bone marrow or fetal liver CD34* cells were isolated by 
CD34 microbeads selection (miltenyi). Newborn mice (within their first 3 days of 
life) were sublethally irradiated (X-ray irradiation with X-RAD 320 irradiator, PXi; 
1 X 150 cGy) and 100,000 fetal liver or 250,000 patient CD34* cells in 20 pl of PBS 
were injected into the liver with a 22-gauge needle (Hamilton Company). Both 
male and female mice with comparable engraftment levels (percentage ofhCD45™, 
hCD3* and hCD14* cells in the blood) were separated randomly into the experi- 
mental groups 6-8 weeks after engraftment. Mice engrafted with patient CD34* 
cells were infected by retro-orbital injection with HIV-1 (100 ng p24), which was 
recovered and expanded from the resting CD4" T cells of the same patient (CD34~ 
cell donor), as mentioned earlier. Mice engrafted with fetal liver CD34” cells were 
infected by intravenous injection with HIV-1 BaL (100 ng p24). Mice with less than 
5% human CD45" cells in the peripheral blood were excluded from the infection 
study. Mice with more than 70% human CD45“ cells in the peripheral blood were 
also excluded because they were unhealthy due to human macrophage/monocyte- 
caused anaemia”®. Twenty million autologous CD8* T cells with or without pre- 
stimulation were injected intravenously 9 or 14 days after infection. Group allocation 
was blinded. Peripheral blood samples were collected by retro-orbital bleeding at 
different time points before and after injection of CD8* T cells. Engraftment of 
human CD45* cells as well as lymphoid and myeloid subsets was determined by 
flow cytometry. Plasma HIV-1 RNA in peripheral blood was measured by one-step 
reverse transcriptase (Invitrogen) real-time PCR using the following primers and 
probe, described previously*: forward (5’ — 3’) ACATCAAGCAGCCATGCAAAT, 
reverse (5’—> 3’) TCTGGCCTGGTGCAATAGG, and probe (5’ > 3’) VIC-CTA 
TCCCATTCTGCAGCTTCCTCATTGATG-TAMRA. Assay sensitivity is 200 RNA 
copies per ml of plasma. HIV-1 DNA in peripheral blood was also measured by real- 
time PCR using the same primers and probe mentioned earlier, with assay sens- 
itivity at 5 copies per 100 Ll of blood. Total viral DNA in PBMCs was determined 
by measuring copies of viral DNA per 100 ll blood and blood volume per mouse 
(80 pl blood per 1 g body weight). To quantitate total viral DNA in tissues, spleens, 
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livers and lungs of infected mice were collected. For the spleen, single-cell suspen- 
sions were treated with ACK lysis buffer. Liver and lung leukocytes were isolated by 
digesting tissues with 100 U ml * collagenase IV and 0.02 mg ml * DNase I (Sigma), 
followed by density gradient centrifugation. 

Statistical analysis. For comparison of HIV-1 variant frequency (Fig. 1b, c) and 
viral infection in HIV-1 BaL-infected mice (Extended Data Figs 6 and 7), we 
applied Mann-Whitney tests. For comparison of the inhibitory effect of autolo- 
gous CTLs (Fig. 2a), we applied a Wilcoxon matched pairs test. For comparison of 
viral replication in humanized mice (Fig. 4f-h), we applied an unpaired t-test. For 
all other comparisons, paired t-tests were applied. All tests were calculated by the 
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GraphPad Prism 6 software, and conducted as two-tailed tests with a type I error 
rate of 5%. No statistical method was used to predetermine sample size. 
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CTL epitopes in HIV-1 Gag 


B Documented Escape B Inferred Escape 


Extended Data Figure 1 | CTL escape variants dominate the latent reservoir 
of CP-treated HIV-1-positive individuals, but not AP-treated individuals. 
Frequency of variants in Gag CTL epitopes in proviruses from resting cp4* 
T cells. Results of all 25 patients tested are shown. Only optimal CTL epitopes 
relevant to each patient’s HLA type are listed in linear positional order on 
the x-axis. Results from both PacBio (left bar) and MiSeq (right bar) sequencing 
platforms are shown for each epitope. The absence of bars above a listed epitope 
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B Diminished Response 


B Susceptible Form 


8 Mutation Type Not Determined 


I 


indicates that only wild-type sequences were detected. For each mutation in a 
CTL epitope, information regarding the effect of the mutation on CTL 
recognition from the Los Alamos National Laboratory (LANL) HIV Molecular 
Immunology Database or from ELISpot assays described in Methods was used 
to assign the mutation to one of the categories indicated at the bottom. See 
Methods for definitions of these categories. 
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Extended Data Figure 2 | Characterization of CTL responses against HIV-1 Gag epitopes by interferon-y ELISpot. Results of seven patients tested are shown. 
The peptides tested are listed for each patient in each graph. Error bars represent s.e.m., n = 3. 
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Extended Data Figure 3 | Partial Gag sequences from proviral DNA and 
outgrowth virus from resting CD4* T cells from eight CP-treated patients. 
CTL epitopes with no observed variation are highlighted in blue. Documented 
escape mutations (red shading), inferred escape mutations (yellow shading), 


diminished response (pink shading), susceptible form (green shading) or 
undetermined variations (grey shading) in relevant optimal epitopes are 
indicated. See Methods for definitions of these types of mutations. This figure 
supplements Fig. le, as a total of nine CP-treated patients were tested. 
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Extended Data Figure 4 | CD8* T cells pre-stimulated with a mixture of 
consensus B Gag peptides eliminate autologous CD4* T cells infected with 
autologous HIV-1 from resting CD4* T cells. a, HIV-1 isolated from 
ART-treated individuals replicates as well as the laboratory strain virus BaL. 
p24 values represent mean of three replicates. Error bars represent s.e.m., = 3. 
b, CD8" T cells are not stimulated after co-culture with PHA-activated CD4* 
T cells. c, A representative flow cytometric analysis of CTL-mediated killing 


after co-culture of infected CD4* T cells with autologous CD8™ T cells. 

CTL activity is measured by the percentage of Gag-positive, CD8-negative cells 
after 3 days of co-culture relative to cultures without CD8* T cells. d, Pre- 
stimulated CD8* T cells eliminate autologous infected CD4* T cells more 
efficiently than non-stimulated CD8* T cells. All results were normalized to the 
CD4 only control group. Error bars represent s.e.m., n = 3. *P < 0.05, 

**P < 0.01, ***P < 0.001, NS, not significant (P > 0.05), paired t-test. 
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Extended Data Figure 6 | Viral dynamics and depletion of CD4* T cells in 
humanized mice. a, Viral dynamics in CP18-infected MIS®°TRG mice. 
CP18-derived MIS“ TRG mice were infected with autologous HIV-1. Plasma 
HIV-1 RNA levels were measured from day 0 to day 56. b, Depletion of CD4* 
T cells in peripheral blood of HIV-1 BaL-infected mice. MIS“ TRG mice 
engrafted with fetal liver CD34* cells were infected with HIV-1 BaL. The 
CD4 to CD8 ratio in peripheral blood was measured by fluorescence-activated 
cell sorting (FACS) from day 0 to day 29 after infection. Error bars represent 
s.e.m., n = 5. c¢, Depletion of CD4* T cells in spleen of HIV-1 BaL-infected 
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mice. MIS“ TRG mice engrafted with fetal liver CD34" cells were infected 
with HIV-1 BaL. The CD4 to CD8 ratio in spleen was measured by FACS 

20 days after infection. Medians and P values from Mann-Whitney test are 
shown. d, Detection of cell-associated HIV-1 RNA in T cells and pee ea 
monocytes. CD3* and CD14* human cells from HIV-1-infected MIS“? TRG 
mice from spleen and lung were purified by FACS. CD3° CD14 cells were 
also collected as controls. Cell-associated HIV-1 RNA was quantified by 
gag-specific quantitative polymerase chain reaction (qPCR). Error bars 
represent s.e.m., 1 = 3. *P < 0.05, unpaired t-test. 
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Extended Data Figure 7 | HIV-1 infection occurs in peripheral blood 

and tissues in humanized mice. a, Engraftment levels of MIS“) TRG mice 
with fetal liver or patient CD34" cells. b, Memory CD4* T cells are detected 
in MIS®TRG mice after infection. MIS®TRG mice were infected with 
HIV-1 BaL. Peripheral blood and indicated tissues from infected mice were 


collected at 20 days post-infection. Memory CD4* T cells were determined by 
CD45RO staining. c, Total number of cell-associated HIV-1 DNA in blood 
and tissues. DNA from peripheral blood or indicated tissues was isolated for 
the measurement of total amount of cell-associated HIV-1 DNA by real-time 
PCR. b, c, Medians and P values from Mann-Whitney test are shown. 
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generation of patient CP36-derived humanized mice is described in Fig.4. Mice _ infection. Plasma HIV-1 RNA (a) and HIV-1 DNA (b) in peripheral blood were 
were infected with autologous viruses at 6 weeks old. CD8* T cells from patient measured by real-time PCR. 
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Extended Data Table 1 | Characteristics of study subjects 


pean Bsa of CD4 count” oe “ — ‘ Treatment start time 
iagnosis (cells/ul) (eopiesiiai) (venrs) after infection (days) 
APO1 2006 1251 <50 7 64 
AP03 2004 595 <50 9 34 
AP04 1998 953 <50 15 77 
APO05 2002 618 <50 11 39 
APO7 2012 592 <50 1.5 67 
AP08 2008 780 <50 6 28 
AP09 2012 1069 <50 2 39 
AP10 2006 513 <50 7 50 
API1 2007 874 <50 6 10 
AP12 2007 629 <50 6 15 
CP05 2001 500 <50 10 >180 
CP12 1997 1074 <50 15 >180 
CP18 1998 773 <50 >4 >180 
CP19 2006 620 <50 6 >180 
CP26 1994 640 <50 16 >180 
CP27 1987 784 <50 4 >180 
CP28 1998 614 <50 6 >180 
CP31 2000 619 <50 10 >180 
CP32 1999 780 <50 2 >180 
CP35 2002 738 <50 10 >180 
CP36 2003 1119 <50 4 >180 
CP37 1999 730 <50 12 >180 
CP38 1986 870 <50 3 >180 
CP39 1996 1152 <50 10 >180 
CP40 2002 544 <50 7 >180 
CP42 1987 684 <50 16 >180 
CP47 1986 792 <50 14 >180 
CP48 1998 641 <50 8 >180 
CP49 1992 864 <50 5 >180 
CP50 2001 964 <50 4 >180 
Spree ANA ele ora patients road otine pnp chia ene intl 
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CEACAMI regulates TIM-3-mediated tolerance 


and exhaustion 


Yu-Hwa Huang’, Chen Zhu2*, Yasuyuki Kondo!*, Ana C. Anderson’, Amit Gandhi!, Andrew Russell’, Stephanie K. Dougan’, 
Britt-Sabina Petersen?, Espen Melum!*, Thomas Pertel’, Kiera L. Clayton’, Monika Raab®, Qiang Chen’, Nicole Beauchemin’”, 
Paul J. Yazaki!!, Michal Pyzik!, Mario A. Ostrowski’, Jonathan N. Glickman’, Christopher E. Rudd®, Hidde L. Ploegh’, 
Andre Franke”, Gregory A. Petsko*, Vijay K. Kuchroo” & Richard S. Blumberg! 


T-cell immunoglobulin domain and mucin domain-3 (TIM-3, also 
known as HAVCR2) is an activation-induced inhibitory molecule 
involved in tolerance and shown to induce T-cell exhaustion in chronic 
viral infection and cancers’ *. Under some conditions, TIM-3 expres- 
sion has also been shown to be stimulatory. Considering that TIM-3, 
like cytotoxic T lymphocyte antigen 4 (CTLA-4) and programmed 
death 1 (PD-1), is being targeted for cancer immunotherapy, it is 
important to identify the circumstances under which TIM-3 can inhibit 
and activate T-cell responses. Here we show that TIM-3 is co-expressed 
and forms a heterodimer with carcinoembryonic antigen cell adhe- 
sion molecule 1 (CEACAM1), another well-known molecule expressed 
on activated T cells and involved in T-cell inhibition® °. Biochemical, 
biophysical and X-ray crystallography studies show that the membrane- 
distal immunoglobulin-variable (IgV)-like amino-terminal domain 
of each is crucial to these interactions. The presence of CEACAM1 
endows TIM-3 with inhibitory function. CEACAM1 facilitates the 
maturation and cell surface expression of TIM-3 by forming a 
heterodimeric interaction in cis through the highly related membrane- 
distal N-terminal domains of each molecule. CEACAM1 and TIM-3 
also bind in trans through their N-terminal domains. Both cis and 
trans interactions between CEACAM1 and TIM-3 determine the 
tolerance-inducing function of TIM-3. In a mouse adoptive transfer 
colitis model, CEACAM1-deficient T cells are hyper-inflammatory 
with reduced cell surface expression of TIM-3 and regulatory cytokines, 
and this is restored by T-cell-specific CEACAM1 expression. During 
chronic viral infection and in a tumour environment, CELACAM1 and 
TIM-3 mark exhausted T cells. Co-blockade of CEACAM1 and TIM-3 
leads to enhancement of anti-tumour immune responses with improved 
elimination of tumours in mouse colorectal cancer models. Thus, 
CEACAM1 serves as a heterophilic ligand for TIM-3 that is required 
for its ability to mediate T-cell inhibition, and this interaction has a 
crucial role in regulating autoimmunity and anti-tumour immunity. 

We examined the role of CEACAM1 in ovalbumin (OVA)-specific 
peripheral T-cell tolerance’. OVA protein administration (Extended 
Data Fig. 1a) resulted in tolerance induction in wild-type OVA-specific 
T-cell receptor transgenic OT-II Rag2‘~ mice (Fig. 1a), but not in trans- 
genic OT-II Ceacam1~‘~ Rag2~‘~ mice (Fig. 1b). Transfer of carboxy- 
fluorescein diacetate succinimidyl ester (CFSE)-labelled naive cpD4* 
Va2" T cells from transgenic OT-II Rag2~'~ mice into Ceacam1~/~ 
recipients (Extended Data Fig. 1b, c) was associated with increased OVA 
induced proliferation (Fig. 1c) and TIM-3 expression uniformly restricted 
to proliferating CEACAMI* T cells (Fig. 1d and Extended Data Fig. 1d). 


CD4° T-cell receptor (TCR) VB8* T cells in Ceacam1~‘~, but not 
wild-type, mice lacked TIM-3 expression after Staphylococcus aureus 
enterotoxin B (SEB) administration, suggesting CEACAM] and TIM-3 
co-expression on tolerized T cells (Extended Data Fig. le, f). Flag-tagged 
human (h) CEACAM1 enhanced cell surface expression of co-transfected 
haemagglutinin (HA)-tagged hTIM-3 in human embryonic kidney 293T 
(HEK293T) cells, with virtually all hTIM-3-positive HEK293T cells nota- 
bly CEACAM1-positive (Fig. le). Human T cells co-expressed TIM-3 and 
CEACAM1 after in vitro activation with decreased CEACAM1 expres- 
sion after TIM3 (also known as HA VCR2) silencing (Fig. 1fand Extended 
Data Fig. 1g, h). Human immunodeficiency virus (HIV)-infected, but not 
uninfected, subjects exhibited increased CEACAM1 * TIM-3> (double- 
positive) CD4* T cells, which were poor producers of interferon-y (IFN- 
), as were double-positive CD8* T cells (Fig. 1g, h and Extended Data 
Fig. 1i-l). In situ proximity ligation analysis’? of hCEACAM1 and hTIM-3 
co-transfected HEK293T cells (Fig. liand Extended Data Fig. lm-o), and 
co-cultures of activated primary human T cells (Extended Data Fig. 1p, q) 
confirmed the nearness of both molecules on the cell surface of HEK293T 
cells and co-localization within the immune synapse of activated T cells, 
respectively. 

TIM-3 has been proposed to engage an unknown ligand’? (Extended 
Data Fig. 2a—c), and we considered CEACAM 1 a possible candidate that 
is known to homodimerize'*. Modelling available X-ray crystallographic 
structures of mouse (m) CEACAM1] (ref. 14) and mTIM-3 (ref. 13) 
membrane-distal IgV-like, N-terminal domains predicted structural 
similarity with extensive interactions along their FG-CC’ interface in 
cis and trans configurations (Extended Data Fig. 2d-g and Supplemen- 
tary Information). Mouse T-cell lymphoma cells predicted to possess a 
novel TIM-3 ligand expressed CEACAM1 (refs 13, 15) (Extended Data 
Fig. 2h, i). hCEACAM1, but not integrin «5 (ITGA5) (Extended Data 
Fig. 3a), was co-immunoprecipitated with hTIM-3 and vice-versa from 
co-transfected HEK293T cells (Fig. 2a, b). Co-immunoprecipitation of 
CEACAM1 and TIM-3 was confirmed with activated primary human 
T cells (Extended Data Fig. 3b) and primary mouse T cells from either 
Ceacam1-4L"® Ceacam1 '~ mice’ or Ceacam1-48"* Ceacam1 ‘~ mice’° 
(transgenic mice in which CEACAM1 isoforms containing a long (L) or 
short (S) cytoplasmic tail, respectively, are conditionally overexpressed 
in T cells)’ (Extended Data Fig. 3c). 

Although tunicamycin treatment had no effect (Extended Data Fig. 3d), 
mutation of amino acid residues including natural human allelic vari- 
ants anticipated to be involved in these interactions disrupted the co- 
immunoprecipitation of hCEACAM1 and hTIM-3 in co-transfected 
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Figure 1 | TIM-3 and CEACAM1 are co-expressed on T cells during 
induction of tolerance. a, b, Tolerance induction in indicated mice. Median 
c.p.m., counts per minute. c, d, Responses of CFSE-labelled transgenic OT-II 
Rag2 '~ T cells in mesenteric lymph nodes (MLN), peripheral lymph node 
(LN) or spleen of wild-type (WT) or Ceacam1 ‘~ recipients to PBS (n = 3 per 
group) or OVA (n = 5 per group) for proliferation (c) and CEACAM1 or 
TIM-3 (d) expression. ND, not detectable. e, hCEACAM1 and hTIM-3 
expression in co-transfected HEK293T cells. Percentage and mean fluorescence 
intensity (MFI) of hTIM-3 indicated. BFA, brefeldin A; ER, endoplasmic 


HEK293T cells (Fig. 2c, d, Extended Data Fig. 3e-s, Extended Data Table 1 
and Supplementary Information). 

A single-chain of the hCEACAM1 N-terminal domain (amino acids 
1-107) joined to the hTIM-3 N-terminal domain (amino acids 1-105) 
by a linker (GGGGS), with a hexahistidine tag appended to the car- 
boxy terminus was expressed in Escherichia coli (Extended Data Fig. 4a), 
and shown to interact specifically with the N-terminal domain of hTIM-3 
by surface plasmon resonance (Extended Data Fig. 4b-e). This single- 
chain protein was crystallized and a structural model built from X-ray 
diffraction data (Extended Data Table 2). This revealed two similar 
copies of ahCEACAM1 (IgV domain)-hTIM-3 (IgV domain) hetero- 
dimer interacting along each of the respective FG—CC’ faces (Fig. 2e) 
and at amino acid contact points demonstrated to be involved in bio- 
chemical interactions between hCEACAMI and hTIM-3 (Fig. 2f, g 
and Extended Data Fig. 4f—h), with no conformational changes of the 
hCEACAM1 IgV domain upon forming a heterodimer with hTIM-3 
in comparison to a CEACAM1 homodimer (Extended Data Table 2, 
Extended Data Fig. 4i-k and Supplementary Information). 

Metabolic labelling (Fig. 2h-j) showed overall enhancement of hTIM-3 
biosynthesis after hCEACAM1 co-expression with decreased cell surface 
display, intracellular accumulation and hypoglycosylation of hTIM-3 
when mutant, hypomorphic forms of hTIM-3 or hCEACAMI were co- 
expressed in HEK293T cells (Extended Data Fig. 5a—e). Especially pro- 
found was a natural hTIM-3 variant (rs147827860 (Thr101Ie)) that 
although exhibiting normal core protein association with hCEACAM1 
(Extended Data Fig. 5f) could not be rescued by hCEACAM1 co-expression 
and remained hypoglycosylated with increased intracellular retention 
anda near absence of cell surface expression (Fig. 2h, i, Extended Data 
Fig. 5a, b, e and Supplementary Information). 

CD4* T cells from SEB-treated Ceacam1‘~ mice or mice with trans- 
genic overexpression of Tim3 (also known as Haver2) in T cells’® lack- 
ing CEACAM 1 expression (Tim3 T8 Ceacam1 ‘~) exhibited diminished 
expression of all mTIM-3 forms (fully glycosylated, hypoglycosylated 
and core protein) (Fig. 2k), decreased upregulation of TIM-3 expression 
on the cell surface (Fig. 21) and blunted deletion of CD4* V8" T cells 
(Extended Data Fig. 5g) relative to wild-type or transgenic Tim3"® mice 
expressing endogenous CEACAM1. T-helper 1 (T};1) polarized cells from 
Tim3"® Ceacam1 ‘~ mice were resistant to galectin-9-induced apo- 
ptosis (Extended Data Fig. 5h), which is dependent on TIM-3 expres- 
sion as a glycoprotein’. 


reticulum. f, KCEACAMI and hTIM-3 expression on activated primary CD4* 
human T cells. g, h, CEACAMI* TIM-3* CD4* T cells (g) and intracellular 
cytokine staining for IFN-y in CD4* T cells after SEB stimulation (h) in 
HIV infection. C, CEACAM1; T, TIM-3 (n = 4 per group). i, In situ proximity 
ligation assay of hCEACAM1 and hTIM-3 co-transfected HEK293T as in e. 
DAPI, 4’,6-diamidino-2-phenylindole. All data are mean + s.e.m. and 
represent five (e, f), three (c, d, i) and two (a, b) independent experiments. 
*P< 0.05; **P< 0.01; ***P<0.001. 


The hTIM-3 N-terminal domain bound hCEACAM1 and stained the 
cell surface of human CEACAM1-4L-transfected Jurkat T cells (Fig. 2m 
and Extended Data Fig. 5i, j). Trans-ligation of wild-type mouse CD4* 
T cells with amCEACAM1 N-terminal domain-Fc fusion protein (NFc) 
rendered them unresponsive to in vitro stimulation with anti-CD3 and 
anti-CD28 (Fig. 2n and Extended Data Fig. 5k). CD4" T cells from SEB- 
treated Tim3'® or Tim3 Ceacam1 ‘~ mice exhibited similar levels of 
TIM-3 associated BAT3, a repressor of TIM-3 cytoplasmic tail signalling”, 
at the time of T-cell isolation (Fig. 20). However, only T cells from SEB- 
tolerized Tim3"% mice, but not SEB-treated Tim3"% Ceacam1~'~ mice 
that were resistant to SEB-induced tolerance, exhibited BAT3 dissocia- 
tion from mTIM-3 after NFc fusion protein ligation with further release 
observed after addition of anti-CD3 (Fig. 20 and Extended Data Fig. 51, m). 
We observed similar levels of NFc fusion protein staining of primary T 
cells from Tim378 and Tim3"8 Ceacam1~'~ mice, relative to that observed 
with Ceacam1~'~ mice, which was blocked by a TIM-3-specific antibody 
(Extended Data Fig. 5n), implicating heterophilic ligation of TIM-3 as 
the major factor responsible for T-cell inhibition and BATS release. 

Highly activated CD4* T cells from wild-type and Ceacam1™'~ mice 
were transduced with a retrovirus encoding green fluorescent protein 
(GFP) plus mTIM-3. Anti-CD3 induced proliferation and TNF-o secre- 
tion was inhibited by NFc fusion protein ligation in transduced wild- 
type but not Ceacam1~'~ T cells or all groups transduced with GFP alone 
(Fig. 2p, qand Extended Data Fig. 50, p). Both wild-type and Ceacam17'~ 
T-cell transductants with a mTIM-3 cytoplasmic tail deletion construct 
(TIM-34252-781)17.18 were unresponsive to anti-CD3 stimulation or NFc 
fusion protein ligation (Fig. 2p, q and Extended Data Fig. 50, p). 

Tolerance pathways are important in T-cell regulation’’. We detected 
homozygous rs147827860 (Thr101 Ile) carriage in inflammatory bowel 
disease (IBD) but not controls (Extended Data Table 3a—c). We thus 
transferred naive CD4* CD62L™" CD44 T cells from wild-type mice 
into Ceacam1~'~ Rag2-'~ recipients and observed colon-infiltrating 
CEACAM1* TIM-3” T cells expressing markedly decreased intracel- 
lular levels of IFN-y, IL-2 and IL-17A relative to other lamina propria 
T cells (Fig. 3a, b). Colon-infiltrating lamina propria T cells from adoptive 
transfer of naive Ceacam1 /~ CD4* T cells into Ceacam1™'~ Rag2-/~ 
recipients expressed reduced cell surface levels of TIM-3 (Fig. 3c) and 
increased intracellular levels of TNF-« (Fig. 3d) in association with a 
severe progressive colitis with increased mortality (Fig. 3e, dagger sym- 
bol), and histopathological evidence of injury (Fig. 3f) characterized by 
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Figure 2 |CEACAMI and TIM-3 heterodimerize and serve as heterophilic 
ligands. a, b, Co-immunoprecipitation (IP) and immunoblot (IB) of wild-type 
hCEACAM1 and hTIM-3 in co-transfected HEK293T cells. c, d, Co- 
immunoprecipitation and immunoblot of wild-type hCEACAM1] and 
hTIM-3 mutants (c) or wild-type hTIM-3 and hCEACAM1 mutants (d) 

as in a and b. e, Human CEACAM1 (IgV)-TIM-3 (IgV) heterodimer 
structure. f, g, 2F, — F. maps contoured at 0.90 showing electron 

densities. h, i, Autoradiogram of anti-haemagglutinin (HA) (hTIM-3) 
immunoprecipitate from metabolic-labelled (h) and pulse-chase metabolic- 
labelled (i) co-transfected HEK293T cells. CHO, carbohydrate; core T, 
non-glycosylated hTIM-3; Cw, wild-type hCEACAM1; EndoH, 
endoglycosidaseH; H2-MA, HA-tagged influenza virus A M2 protein; 

T, hTIM-3(Thr1011e); Tw, wild-type hTIM-3. hTIM-3 isoforms noted. 

j, Quantification of densities in i (n = 3 per group). k, Immunoblot for mTIM-3 


acute and chronic inflammation (Extended Data Fig. 6a). Reestablish- 
ment of T-cell-specific mCEACAM1-4L expression using Ceacam1-4L™ 
Ceacam1~'~ mice®, restored TIM-3 display and decreased intracellular 
TNF-« expression by the infiltrating lamina propria T cells together 
with decreased disease severity (Fig. 3c-f and Extended Data Fig. 6b), 
reflective of CEACAM1* TIM-3~ T-cell restoration. 

CD4* T cells from Tim3"’ Ceacam1 ‘~ mice, in contrast to those 
from CEACAM1-proficient Tim3'® mice, were hyper-inflammatory 
after transfer into Ceacam1™'~ Rag2'~ recipients and caused significant 
weight loss and severe colitis with increased expression of T};17 signature 
genes (Fig. 3g—-i and Extended Data Fig. 6c, d). Nanostring (Extended 
Data Fig. 6e) and quantitative PCR (Fig. 3j) analysis of lamina propria 
mononuclear cells demonstrated increased transcripts for Ebi3, IL-27p28 
(also known as 127) and IL-12p35 (also known as I/12a) encoding the 
regulatory cytokines IL-27 and IL-35 (refs 20, 21) in Ceacam1_'~ Rag2'~ 
recipients of CD4™ T cells from Tim3"* relative to wild-type mice, and 
their absence in recipients of Ceacam1 '~ or Tim3"® Ceacam1'~ T cells. 

Anti-tumour immunity is hindered by T-cell expression of inhibitory 
molecules associated with an exhausted phenotype”. We observed high 
levels of CEACAM1* TIM-3", relative to PD-1* TIM-3*, CD4* and 
CD8* T cells at the tumour site of wild-type mice in the azoxymethane 
(AOM)/dextran sodium sulphate (DSS) model of colitis-associated colon 
cancer (Extended Data Fig. 6f-j). At 2.5% DSS, Tim3"* mice exhibited 
100% fatality, significantly greater than all other genotypes (Fig. 4a). 
Tumour quantification at a lower DSS dose (1.5%) showed that the polyp 
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from PBS-treated (—) or SEB-treated (+) CD4* T cells. Labelling as in h and 
i. 1, mTIM-3 expression after SEB tolerance induction. m, Column-bound 
glutathione S-transferase (GST)-hTIM-3 IgV-domain pull-down of 
hCEACAM1 detected by immunoblot. GST’, GST-hTIM-3 dimer. Ft, flow 
through. n, Suppression of mouse CD4* T-cell proliferation by mCEACAMI1 
N-terminal domain-Fc fusion protein (NFc). 0, Immunoprecipitation of 
mTIM-3 and immunoblot for BAT3 or mTIM-3 from lysates of CD4* T cells. 
p, q, Proliferation of CD4* T cells from wild-type (p) and Ceacam1-/~ 

(q) mice transduced with wild-type mTIM-3 (Tw), mTIM-34?°?-?*! (Tmut) 
or vector exposed to anti-CD3 and either NFc or IgG1-Fc (IgG1). Data are 
mean = s.e.m. and represent five (a, b), four (c, d), three (h-j, 1, n, p, q) and two 
(k, m, o) independent experiments. NS, not significant; *P < 0.05; **P < 0.01; 
*EED < 0.001. 


numbers, polyp size, high-grade dysplasia and frank carcinoma were 
significantly higher in Tim3"* than in wild-type mice with decreased neo- 
plasia observed in Tim3"® Ceacam1 ~~ and Ceacam1~'~ mice (Fig. 4b-d 
and Extended Data Fig. 6k). 

After implantation of colorectal cancer cells (CT26) subcutaneously 
in BALB/c mice, a significant fraction of CD8* tumour infiltrating lym- 
phocytes (TILs) was triple-positive T cells (TIM-3 * pp-1* CEACAM1°), 
with the relative cell surface levels of TIM-3 correlating with CEACAM1 
(Fig. 4e). PD-1* TIM-3>"8+t CEACAMI" T cells were characterized 
by extremely low intracellular IL-2 and TNF-« expression consistent 
with exhaustion (Fig. 4f). Co-administration of an anti-mCEACAM1 
and anti-mTIM-3 antibody delayed subcutaneous tumour growth ina 
preventative model (Fig. 4g and Extended Data Fig. 7a), and was able 
to exceed protection with co-blockade of PD-1 and TIM-3 (Extended 
Data Fig. 7b, c). Ina therapeutic model, blockade of CEACAM1 synergized 
with PD-1 inhibition (Extended Data Fig. 7d, e). CEACAM1]1 and TIM-3 
co-blockade was associated with increased CD8* (Fig. 4h) and cpD4* 
(Fig. 4i) TILs with enhanced IFN-y production (Fig. 4j) and decreased 
IL-10 production (Extended Data Fig. 7f) by the infiltrating CD8* and 
CD4* T cells, respectively. Increased tumour antigen-specific CD8* 
T cells defined by AH1-tetramer staining”, relative to the total CD8* 
T-cell population (Fig. 4k and Extended Data Fig. 7g) was observed in 
the draining lymph nodes that correlated with tumour growth inhibi- 
tion (Extended Data Fig. 7h). CT26 tumour growth was impeded in 
Ceacam1~/~ mice (Fig. 41) in association with increased AH-1 tetramer~ 
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Figure 3 | TIM-3 regulation of mucosa-associated inflammation requires 
CEACAM1. a, mCEACAM1 and mTIM-3 expression on colonic lamina 
propria CD4* T cells. b, Intracellular cytokines in cells described in a. 

c, d, mTIM-3 (c) and intracellular TNF-« (d) expression in lamina propria 
CD4* T cells from indicated donors. e, Body weights relative to weights on 
day 14 of groups in c and d. Five mice expired (7). f, Score of surviving mice 


CD8* T cells in draining lymph nodes (Fig. 4m and Extended Data 
Fig. 7i) and decreased TIM-3 expression on tumour-associated TILs 
(Fig. 4n). 

In conclusion, we show that CEACAM] and TIM-3 form a new het- 
erodimeric complex that is determined by interactions between their 
structurally similar membrane-distal IgV-like, N-terminal domains. This 


Oo 
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of groups in e. g, Body weights of genotypes as in e. h, Score of groups 
described in g. i, j, Nanostring (i) and quantitative PCR (j) of lamina propria 
mononuclear cells. Actb, B-actin gene. All data are mean + s.e.m. and represent 
six (a), four (b) and three (c-j) independent experiments. *P < 0.05; 

**P < 0.01; ***P < 0.001. 


association is evident during early biosynthesis and crucial for proper 
maturation, cell surface display and function of TIM-3 and probably 
CEACAM1 when expressed together. CEACAM1 also functions as an 
essential trans-heterophilic ligand for TIM-3 with the tolerance-inducing 
functions of TIM-3 requiring interactions with CEACAM1 in both cis 
and trans configurations. CEACAM] expression with TIM-3 is further 
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Figure 4 | CEACAMI determines TIM-3 regulation of anti-tumour 
immune responses. a, Survival curves in AOM/2.5% DSS model. 

b-d, Assessment of polyp numbers (b), polyp size (c) and cancer grades (d) in 
AOM/1.5% DSS model. e, Staining of CD8* T cells associated with CT26 
tumours. f, Intracellular cytokine expression in TIL subsets after anti-CD3 
stimulation. g, Prevention of CT26 tumour growth in wild-type mice (n = 5 per 
group). mAb, monoclonal antibody. h-k, Analysis of TILs for relative 


Time (days) 


proportion of CD8* (h) and CD4* (i) T cells, IFN-y* CD8* T cells (j) and 
tumour-specific (AH1-tetramer, tet*) CD8* T cells in draining lymph nodes 
(dLN) (k) in groups described in g. I-n, Growth of CT26 cells (I), AH1 tet* 
CD8* T cells in dLN (m) and TIM-3 expression on TILs (n) in wild-type and 
Ceacam1~'~ mice. Data are mean + s.e.m. and represent four (e), three (g-k) 
and two (a-d, f, |-n) independent experiments. *P < 0.05; **P < 0.01; 
***D <(0,001. 

15 JANUARY 2015 | VOL 517 | 


NATURE | 389 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


characteristic of T cells with an exhausted phenotype such that in the 
absence of these interactions inflammatory responses are unrestrained 
and anti-tumour immunity enhanced. Together, these studies describe 
a novel class of heterodimeric protein interactions that function in tol- 
erance induction with broad implications for many types of infectious, 
autoimmune and neoplastic conditions. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Mice. C57BL/6 and BALB/c mice were purchased from Jackson Laboratory. 
Ceacam1~'~ mice (C57BL/6 and BALB/c backcrosses) were generated by N. 
Beauchemin”. Transgenic OT-II Ceacam1 '~ Rag2~'~ mice were generated by 
first crossing Ceacam1~'~ mice to Rag2'~ (Taconic) for >6 generations to obtain 
Ceacam1 ’~ Rag2’~ double-knockout mice on a C57BL/6 background, followed 
by crossing these mice to transgenic OT-II Rag2 ’~ mice (Taconic) for >9 gener- 
ations. Tim3"® (ref. 16) and Ceacam1~/~ mice ona C57BL/6 background were inter- 
crossed to generate Tim3"® Ceacam1 ’~ mice. Ceacam1-4L'® Ceacam1 ‘~ and 
Ceacam1-4S"8 Ceacam1 /~ mice have been previously described*"”. All studies 
were performed in mice with a C57BL/6 genetic background except for CT26 tumour 
experiments that were in a BALB/c genetic background. Animal studies were con- 
ducted in a gender and age-matched manner for all experiments. Both male and 
female mice were used and were 6-8 weeks of age at the time of experiments or 
6-10 weeks for the adoptive transfer colitis experiments. The number of animals 
used per group was based on previous experimental results and observed variability. 
Histopathology analysis was performed in a blinded manner by an expert patho- 
logist (J.N.G.). For all other in vitro and in vivo analyses, investigators were not 
blinded to treatment allocation. All experiments were approved and conducted 
according to the guidelines set forth by the Harvard Medical Area Standing Com- 
mittee on Animals. 

Genotyping. Mouse tails were digested in tail lysis buffer (100 mM Tris-HCl, pH 8.5, 
5 mM EDTA, 0.2% SDS, 200 mM NaCland proteinase K (Roche)) overnight at 55 °C. 
Genomic DNA was phenol-extracted and isopropanol precipitated. DNA was dis- 
solved in TE buffer. The primer sequences for genotyping are available on request. 
T-cell tolerance animal models. For SEB tolerance induction model, animals of 
the indicated genotypes were injected intraperitoneally with 25 ig of SEB (Millipore) 
on days 0 and 4, and euthanized at day 8 after the first injection. Peripheral lymph 
node cells were isolated and stained for T-cell receptor VB6, VB8 and CD4, and 
analysed for TIM-3 and CEACAM1 expression on CD4* VB8° cells. Cells were 
also re-stimulated in vitro at 0.5 X 10° cells per ml with anti-CD3 (0.1-10.0 1g ml) 
and assessed for proliferation by [*H]-thymidine (1 Ci ml” ') uptake or produc- 
tion of IL-2 by ELISA. For OVA antigen-specific T-cell tolerance induction, high- 
dose OVA antigen was used and modified according to previous studies'’”*”*. In 
brief, transgenic OT-II Rag2'~ or Ceacam1'~ Rag?’ mice were injected intra- 
peritonally with 500 pg of OVA peptide (323-339, 323-ISQA VHAAHAEINEAGR- 
329, AnaSpec) in an equal volume amount of complete Freund’s adjuvant (CFA) or 
PBS alone at day 0 (activation of immune response) and at day 4 (tolerance induction). 
At day 10, CD4* T cells were isolated from spleens and pooled peripheral lym- 
phocytes from each experimentally treated mouse as indicated above. The pooled 
lymphocytes from each mouse were re-stimulated in vitro with various concentra- 
tions of OVA33-339 peptide (0.1, 1, 2.5 and 10 yg ml | '). The 96-well cultures were 
set up with a total cell number of 2.5 X 10*in 200 pl RPMI complete medium pulsed 
with 1 Ci [PH] -thymidine (PerkinElmer Life Sciences) for the final 8 h of the 72 h 
assay, and collected with a Packard Micromate cell harvester. Counts per minute 
(c.p.m.) were determined using a Packard Matrix 96 direct counter (Packard Bio- 
sciences) and data expressed as median values. For in vivo tracking of OVA-specific 
antigen-specific T-cell responses, adoptive transfer experiments were performed 
on the basis of previous studies”’. In brief, CD4* Va2™ T cells were isolated from 
spleens and pooled peripheral lymphocytes of transgenic OT-II Ceacam1 *!* (wild- 
type) Rag2’ ~ mice and the cells were pre-incubated for 10 min with 4 4M CFSE 
in PBS plus 0.1% FBS. CFSE-labelled cells (5 X 10°) were injected intravenously by 
tail vein into syngeneic C57BL/6 wild-type or Ceacam1 ~~ recipients. After 18h 
to allow the adoptively transferred T cells to establish themselves in vivo, and which 
showed no differences in parking of the T cells among the various genotypes, the 
mice were then immunized with OVA3 3-339 (50 1g) in an equal volume of CFA. 
On day 3, cells were collected from spleen, mesenteric lymph and peripheral lymph 
node of individual mice and stained with DAPI to exclude dead cells, and con- 
currently stained with anti-CD4—PerCp, anti-CEACAM1- allophycocyanin and 
anti-TIM-3-phycoerythrin. Total lymphocytes were gated and the CFSE subset 
representing the transgenic OT-II, adoptively transferred CD4* T cells analysed 
and CEACAM1- and TIM-3-expression assessed on the CFSE-positive cells. 
Cell lines. 3T3 mouse fibroblast cells, the TK-1 mouse T cell lymphoma cell line, the 
HEK293T cell line, and CT26 mouse colorectal cancer cell lines were all purchased 
from ATCC as mycoplasma-free cell lines and culture conditions were all followed 
according to ATCC instructions. Human CEACAM1-4L transfected human Jurkat 
T-cell line and human CEACAM1-4L transfected HeLa cells have been previously 
described (provided by J. Shively and S. G. Owen)”**”’. 

Antibodies and reagents. An anti-CEACAM1 monoclonal antibody that binds to 
the C-C’ loop of mCEACAM1 (ref. 30) (ccl, provided by K. V. Holmes and J. Shively) 
and isotype control (mouse IgG1) antibodies for the in vivo injection experiments 
were purchased from Bio X Cell. An Fc fusion protein consisting of the N-terminal 
domain of mouse CEACAM1 together with human IgG1, N-CEACAM1-Fc”! was 
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provided by T. Gallagher. The following fluorochrome-conjugated antibodies were 
purchased from BioLegend: anti-IFN-y (XMG1.2), anti-IL-2 (JES6-5H4), anti-IL- 
17A (TC11-18H10.1), anti-IL-10 (JES5-16E3) and PD-L1 (10F.9G2); and other 
antibodies were purchased from eBioscience: anti-CD3 (clone, 145-2C11), anti- 
CD28 (clone 37.51), anti-CD8 (clone, 53-6.7), anti-CD4 (clone, RM4-5), anti- 
TNF-a (clone, MP6-XT22) and anti-TIM-3 (clone, 8B.2C12). The BAT3 antibody 
(clone AB10517) was purchased from Millipore. Brefeldin A and monensin were 
from eBioscience. The anti-TIM-3 5D 12, 2C12, 2E2, 2E12 and 3F9 antibodies were 
generated in V.K.K.’s laboratory and 5D12 was conjugated to phycoerythrin and 
allophycocyanin by BioLegend. The anti-human CEACAM1 antibodies specific 
for the N-terminal domain (34B1, 26H7 and 5F4) have been previously described 
and characterized in R.S.B.’s laboratory**”’. All the antibodies used for biochemical 
experiments if otherwise indicated, were purchased from Cell Signaling. Tunicamycin 
was purchased from Sigma. 

Primary human T-cell isolation. Human peripheral blood mononuclear cells 
(PBMCs) were isolated from buffy coats from healthy anonymous donors accord- 
ing to an Institutional Review Board approved protocol from Brigham and Women’s 
Hospital. CD4* T cells were purified from PBMC using CD4 Microbeads (Miltenyi) 
and cultured in RPMI-1640 supplemented with 10% FBS (Invitrogen), L-glutamine, 
non-essential amino acids, sodium pyruvate, 20 mM HEPES and 100 IU ml ! recom- 
binant human IL-2 (R&D Systems). The T cells were stimulated with plate-bound 
anti-CD3 (OKT3) and anti-CD28 (CD28.2) antibodies (1 jig ml | each, eBioscience). 
Cellular based studies on HIV-infected individuals. In the analysis of HIV-infected 
individuals, informed consent was obtained in accordance with the guidelines for 
conduction of clinical research at the University of Toronto and Maple Leaf Clinic 
institutional ethics boards. Written informed consent was provided for this study, 
which was reviewed by research ethics board of the University of Toronto, Canada 
and of St. Michael’s Hospital, Toronto, Canada. Individuals were recruited from 
a Toronto-based cohort (Maple Leaf Clinic and St. Michael’s Hospital, Toronto, 
Canada). Samples were obtained from HIV-positive chronically infected antiretro- 
viral treatment naive individuals (infected >1 year, with detectable viral load) and 
demographically matched HIV-seronegative individuals. Whole blood was collected 
in anti-coagulant treated tubes and PBMCs were isolated using Ficoll-Paque PLUS 
(GE Healthcare Bio-Sciences) and stored at — 150 °C until future use. PBMCs from 
healthy HIV-1-uninfected and HIV-1-infected individuals were stained with CEACAM1 
monoclonal antibody (clone 26H7), detected with a fluorophore-conjugated sec- 
ondary antibody, followed by fluorophore-conjugated monoclonal antibodies to 
CD4, CD8 and CD3 (BioLegend), and TIM-3 (R & D Systems) to determine phe- 
notype assessment. An Aqua amine dye (Invitrogen) was used to discriminate live 
and dead cells. For functional experiments, cells were stimulated after thawing with 
1 pg ml of overlapping HIV-1 Clade B Gag peptide pool (National Institutes of 
Health AIDS Reagent Program), or 1 jg ml‘ SEB (Sigma Aldrich) in the presence 
of brefeldin A (Sigma) for 6 h. Surface staining was followed by a fixation and per- 
meabilization step. Intracellular staining for cytokines was performed using anti- 
IFN-y (Biolegend). Cells were fixed in 2% formalin/PBS and acquired with a modified 
LSRII system (BD Biosciences). A total of 100,000 events were collected and analysed 
with FlowJo software (Tree Star). 

In situ proximity ligation assay. Proximity ligation assay was performed using Duo- 
link in situ PLA reagents on HEK293T cells co-transfected with haman CEACAM1- 
Flag-tagged and human TIM-3-HA-tagged vectors. Primary antibodies used were 
anti-Flag or anti-HA antibodies and followed by secondary antibodies. Cells on 
slides were blocked with Duolink Blocking stock followed by the application of two 
PLA probes in 1X antibody diluent. The slides were washed in a wash buffer (1x 
TBS-T) for 5 min twice and processed for hybridization using Duolink Hybridi- 
zation stock 1:5 in high purity water and followed by incubation for 15 min at 37 °C. 
Duolink Ligation was performed with ligase and the slides were incubated in a pre- 
heated humidity chamber for 15 min at 37 °C. Amplification was then achieved 
using Duolink Amplification stock containing the polymerase and the slides were 
incubated again in a pre-heated humidity chamber for 90 min at 37 °C. DNA was 
stained with DAPI. A proximity ligation assay probe generates a fluorescent signal 
only when it binds to two primary antibodies (anti-HA and anti-Flag) attached to two 
proteins that are in maximum distance of 30-40 nm (ref. 12) to each other. One indi- 
vidual dot represents the close proximity of two interacting proteins within the cells. 
Confocal microscopy of T-B-cell conjugates. CD4* and CD8* T cells were iso- 
lated from human PBMCs by negative selection (EasySep, StemCell Technologies) 
and expanded by incubation with 1 1g ml ! anti-CD3 (clone OKT3; BioLegend), 
1 wg ml”! anti-CD28 monoclonal antibody (clone 28.8; BioLegend) with 50 U ml 
of recombinant IL-2 (National Institutes of Health) in complete RPMI media for 
5 days to upregulate CEACAM1 and TIM-3. After overnight rest, T cells were mixed 
in a 2:1 ratio with Cell Tracker Blue CMAC (Invitrogen)-labelled human EBV- 
transformed B cells pre-loaded with SEB (Sigma-Aldrich), and brought into contact 
by centrifugation at 200g for 5 min. Cells were incubated at 37 °C for 10 min and 
deposited onto poly-L-lysine coated coverslips for an additional 15 min. Cells were 
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fixed in 4% formaldehyde, permeabilized, blocked and stained with primary anti- 
bodies against CD3-¢ (Abcam), CEACAM1 (clone 5F4) and TIM-3 (R&D Systems) 
and appropriate fluorescently labelled secondary antibody. After immunofluores- 
cent labelling, coverslips were mounted on glass slides using ProLong Gold antifade 
reagent (Invitrogen) and cured in the dark at room temperature for 24 h. Imaging 
was performed on an Olympus [X81 confocal microscope. Z-stack images (0.15 1m) 
of conjugates were acquired with a X60 Plan-Apochromat oil objective, numerical 
aperture 1.42. 

Image processing and analysis. All fluorescence images were background subtracted, 
annotated and exported using Olympus Fluoview FV 100 image viewer. Colocali- 
zation analysis and Pearson’s correlation coefficient was calculated from corres- 
ponding interface regions from two channels using Volocity. For presentation in 
figures, images were adjusted and cropped equally across related groups. 
Mutagenesis of human CEACAM1 and TIM-3. Point mutations were introduced 
by PCR-based mutagenesis, using the QuikChange II Site-Directed Mutagenesis 
Kit (Agilent Technologies). The mutant oligonucleotides are listed in Extended Data 
Table 1a, b. Previously described vectors containing the human CEACAM1-3L 
variant™* and human TIM-3 in the pDisplay vector (Invitrogen)** were used as the 
template for all mutations. PCR reactions for single amino acid mutations were run 
for 16 cycles of 30s at 95 °C and 1 min at 55 °C, followed by 6 min at 68 °C. The 
resulting mutant plasmids were verified by Sanger DNA sequencing. TIM-3 amino 
acid residues were numbered according to National Center for Biotechnology Infor- 
mation database. 

Cell culture, transfection, tunicamycin treatment, immunoprecipitation and 
immunoblotting. HEK293T cells transfected with the 1,200 ng of Flag-tagged human 
CEACAM1I wild-type or mutant vectors or 1,200 ng of Flag-tagged ITGA5 (NM_ 
002205; Origene) and 1,200 ng of HA-tagged human TIM-3 wild-type or mutant 
vectors or 1,200 ng of vector controls when mono-transfections were performed 
and cells transfected for 48 h. In some experiments, 6h after transfection, trans- 
fected cells were treated with 2 1g ml * or 10 wg ml’ tunicamyin provided in DMSO 
for the last 24h of transfection. Transfected cells were washed once with cold PBS 
and lysed on ice with 0.5 ml of immunoprecipitation buffer containing 20 mM 
Tris-HCl, 0.15 M sodium chloride, pH 7.6, with protease inhibitor cocktail tablets 
(Roche) and 1.0% digitonin (Sigma). After 60 min, the cell lysates were spun at 
14,000 r.p.m. for 30 min at 4 °C. The lysate was subsequently incubated with 5 l 
of protein A-Sepharose (Sigma) that had been pre-adsorbed with an equal volume 
of immunoprecipitation buffer followed by three 1-h incubations with protein 
A-Sepharose. The pre-cleared lysates were incubated for 2 h at 4 °C with 10 pl of 
protein A-Sepharose beads bound to the specific antibodies. Some immunopreci- 
pitates were incubated with agarose-HA antibody. The control and specific immu- 
noprecipitates were washed with immunoprecipitation buffer and re-suspended in 
30 ul of Laemmli sample buffer without reducing agents. After boiling for 5 min, 
the proteins were resolved by SDS-PAGE in regular Tris-glycine buffer on a 4-20% 
Tris-Glycine Gel (Novex). The proteins were electrically transferred to a PVDF 
(polyvinylidene difluoride) membrane. After blocking with 5% skim milk in 0.05% 
PBS-Tween (PBS-T), the membranes were incubated for 12 h at 4 °C with primary 
antibodies. The membranes were further incubated with corresponding secondary 
antibodies for 1 h at room temperature and visualized by Amersham ECL Western 
Blotting Detection Reagents (GE Healthcare). The specific antibodies used for immu- 
noprecipitation and immunoblotting in this assay were anti-Flag antibody pro- 
duced in rabbit (Sigma), HA.11 (16B12) (Covance), anti- HA-Tag-agarose (MBL, 
Medical & Biological Laboratories Co), anti-HA antibody produced in rabbit and 
in mouse (Sigma) and anti-human CEACAM1 antibody, 5F4. Unsaturated films 
were digitally scanned and band intensities (densitometric analysis) were quan- 
tified using ImageJ (NIH). 

Co-immunoprecipitation of primary T cells. Primary human CD4" T cells were 
purified and activated with plate bound anti-CD3 (11g ml~') and anti-CD28 
(1 pg ml‘) in the presence of recombinant (r) IL-2 (10 ng ml‘; NIH) for 5 days. 
Cells were then rested in rIL-2 containing (10 ng ml” ') complete medium for 3 days 
and re-stimulated with soluble anti-CD3 (1 pg ml” ') for 2 days. Cell lysates were 
prepared as above and treated with N-glycanase by the manufacturer’s suggested 
protocol (Promega). N-glycanase treated lysates were immunoprecipitated with 
anti-human TIM-3 antibodies (2E2, 2E12 and 3F9) or mouse IgG1 as control 
and the immunoprecipitates subjected to SDS-PAGE and transferred to PVDF 
membranes followed by immunoblotting with the mouse anti-human CEACAM1 
monoclonal antibody, 5F4, as described above. Primary mouse splenocytes were 
prepared from Ceacam1-4L"® Ceacam1 '~ and Ceacam1-4S"* Ceacam1 '~ mice 
and cultured in the presence of anti-CD3 (1 1g ml!) or anti-CD3 (1 ug/ml!) 
plus anti-CD28 (1 jg ml” ') or medium only for 96 h. Cell lysates were prepared, 
immunoprecipitated with ccl (anti-mCEACAM1) and immunblotted with 5D12 
(anti-mTIM-3) monoclonal antibodies as described above. 

Pulse-chase experiments, immunoprecipitation, endoglycosidaseH digestion 
and SDS-PAGE. Cells were transfected as above with wild-type human HA-tagged 


hTIM-3 or Thr1011e variant of HA-tagged hTIM-3 with or without co-transfection 
with Flag-tagged human CEACAM1 vector. All transfections included a co-transfection 
with Flag-tagged influenza virus M2 protein (M2) encoding vector as a non-binding 
control. After 48 h, transfected HEK293T cells were incubated with methionine- 
and cysteine-free DMEM for 30 minat 37 °C. Cells were labelled with 10 mCi ml’ 
[°°S]-methionine/cysteine (1,175 Cimmol !; PerkinElmer Life Sciences) at 37 °C 
for the indicated times and chased with DMEM at 37 °C for the indicated times. 
Cells were lysed in Nonidet P-40 lysis buffer (50 mM Tris, pH 7.4, 0.5% Nonidet 
P-40, 5mM MgCl, and 150mM NaCl). Immunoprecipitations were performed 
using 30 pl of anti-HA agarose (Roche) for 2h at 4°C with gentle agitation. For 
enzymatic digestions, immunoprecipitates were denatured in glycoprotein dena- 
turing buffer (0.5% SDS, 1% 2-mercaptoethanol) at 95 °C for 5 min, followed by 
addition of sodium citrate (pH 5.5) to a final concentration of 50 mM, and incubated 
with endoglycosidaseH (New England Biolabs) at 37 °C for 2h. Immune com- 
plexes were eluted by boiling in reducing sample buffer, subjected to SDS-PAGE 
(10%), and visualized by autoradiography. Densitometric quantification of radio- 
activity was performed on a PhosphorImager (Fujifilm BAS-2500) using Image 
Reader BAS-2500 V1.8 software (Fujifilm) and Multi Gauge V2.2 (Fujifilm) soft- 
ware for analysis. Quantification of human TIM-3 in pulse-chase metabolic label- 
ling was calculated by the following formula representing the average of 2-4 data 
sets: (human TIM-3 signal intensity — M2 protein signal intensity/M2 protein 
signal intensity). 

Transduction of primary human T cells with shRNA lentiviral vectors to silence 
human TIM-3. Lentiviral vectors encoding short hairpin RNAs (shRNAs) (pLKO.1) 
targeting human TIM3 anda control shRNA targeting lacZ were obtained from the 
Dana Farber DNA Resource Core. Lentiviral particles were produced as previously 
described**. Primary human CD4* T cells were transduced 3 days after stimula- 
tion and selected with 2 1g ml’ puromycin (Sigma) for 5 days followed by gradual 
increases of puromycin (5 jig ml’) for the next 5 days and re-stimulated with anti- 
CD3 and CD28 for an additional 5 days. TIM3 shRNA 1 (TRCN0000158033, 5’-C 
GTGGACCAAACTGAAGCTAT-3’); TIM3 shRNA 2 (TRCN0000154618, 5'-G 
CACTGAACTTAAACAGGCAT-3’); TIM3 shRNA 3 (TRCN0000157816, 5’-C 
AAATGCAGTAGCAGAGGGAA-3’); lacZ control knockdown (TRCN0000072225; 
5'-CTCTGGCTAACGGTACGCGTA-3’). 

Transduction of primary mouse T cells with retroviral vectors. For transduction 
of mouse primary CD4* T cells, previously described mouse TIM-3 and its mutant 
expressing retroviruses were prepared in HEK293T cells using eco and gag/pol viral 
envelope constructs (Clontech Inc.)!”*”**. Viral supernatants were collected 48 h 
after transfection and centrifuged and filtered with a 0.45-1m filters. CD4” T cells 
were prepared at 10° cells per ml in the presence of polybrene (Sigma; 8 mg ml‘). 
Viral particles and CD4* T cells were co-incubated by spin inoculation at 2,000¢ 
for 60 min on two sequential days. To detect ectopic expression of the constructs in 
T cells, cells were allowed to rest for 3 days in 2ngml ' IL-2 and sorted to obtain 
cells with the highest (45%) GFP expression. Sorted cells were titrated into in vitro 
T-cell assays. 

Galectin-9 induction of apoptosis. Naive CD4* T cells were polarized under Ty1 
inducing conditions. In brief, naive CD4* T cells (2 X 10° per ml) were stimulated 
with plate-bound anti-CD3 specific antibodies (10 pg ml” ') for 48 h. Soluble anti- 
CD28 antibodies (1 1g ml), recombinant (r) IL-2 (10 ng ml !, NIH) and block- 
ing antibodies to IL-4 (10 pg ml!) and rIL-12 (5 ng ml ') were added and the 
cells cultured for 7 days. Cells were washed and treated with galectin-9 (2 ug ml '; 
eBioscience) for 8 h to induce apoptosis. Apoptosis was detected by flow cytometry 
using apoptosis staining kit (Roche) that stains for annexin V and propidium idodide. 
Protein purification, crystallization, X-ray data collection and model determi- 
nation. Competent E. coli BL21 DE3 cells were transformed with a pET9a vector 
carrying a gene insert coding for a single-chain ofa protein consisting of the human 
CEACAM1 IgV domain (residues 1-107), a 20 amino acid linker (4 repeats of a 
Gly-Gly-Gly-Gly-Ser motif), and the human IgV domain of TIM-3 (residues 1-105) 
with a hexa-histidine tag appended to the C terminus. To express the protein, trans- 
formants were grown in 1 litre of LB broth under antibiotic selection and induced 
with 0.1 mM isopropyl-f-p-thiogalactoside (IPTG) after reaching an absorbance 
at 600 nm (A¢00 nm) of approximately 0.8. Cells were grown for an additional 16 hat 
18°C and collected by centrifugation. Protein was purified from inclusion bodies, 
which were solubilized in a chaotropic buffer and refolded in a buffer containing 
200 mM Tris, pH 8.5, 0.4M arginine-HCl, 2mM EDTA, 5 mM cysteamine and 
0.5 mM cystamine, as reported previously”. Refolded protein was loaded onto a 
Ni-NTA affinity column and washed several times with a buffer containing 50 mM 
HEPES, pH 7.5, 300 mM NaCl, 2.5 mM CaCl, 10% glycerol and 10-30 mM imi- 
dazole. The protein was eluted with 50 mM HEPES buffer, pH 7.2, 300 mM NaCl, 
2.5 mM CaCly, 10% glycerol and 300 mM imidazole. Eluted protein fractions were 
analysed by SDS-PAGE, pooled, concentrated and further purified with a Superdex 
75 gel filtration column (GE Healthcare) in a buffer containing 50 mM HEPES 
buffer, pH 7.5, 200 mM NaCl, 2.5 mM CaCl, and 10% glycerol. Circular dichroism 
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was conducted to confirm proper protein folding. Purified protein (purity >95%, 
estimated by SDS-PAGE) was concentrated to 2.5 mg ml ' for crystallization. 

The initial crystal growth trials were performed by vapour diffusion using a Phoe- 
nix robotic system (Art Robbins) with Index Screens 1 and 2 (Hampton Research) 
and optimized using an additive screen (Hampton Research). Optimized crystals 
were formed in 4 il drops (2 il protein, 2 pl mother liquor), equilibrated against a 
mother liquor containing 100 mM Tris 8.0 and 25% PEG 3350, at room tempera- 
ture in a sitting drop setup. For data collection, crystals were cryoprotected in solu- 
tion containing 25% PEG 3350 and 12% glycerol. X-ray data were collected using 
the X25 beamline at the National Synchrotron Light Source (NSLS). The data were 
processed with Mosflm*° and the CCP4 software suite*!. 

A crystallographic model of the single-chain human CEACAM1-TIM-3 protein 

was built by a molecular replacement strategy using a search model made by modi- 
fying published structures of the IgV (N) domain of human CEACAM1 (Protein 
Data Bank (PDB) code 2GK2) and mouse TIM-3 (PDB code 2OYP). Specifically, 
all residues were changed to alanine to reduce model bias and all small molecule 
ligands, water and metals were removed. Molecular replacement was performed 
with the program Molrep. Model building and structural refinement were performed 
with the PHENIX software’ and COOT”, respectively. Five per cent of the total 
reflection data were excluded from the refinement cycles and used to calculate the 
free R factor (Reree) for monitoring refinement progress. Repeated rounds of model 
building and refinement with group B-factors refinement strategy and Torsion- 
NCS restraints resulted in a final crystallographic Rwork/ Riree of 34.5%/37.3% using 
all data to 3.4A resolution. The X-ray data and refinement statistics are shown in 
Extended Data Table 2. Most of the residues were well defined and verified by PDB 
validation report (data not shown). However, the B-strands associated with the hTIM-3 
AB-ED (ref. 13) face did not show the secondary structure of B-sheet formation, 
and two disulphide bonds (Cys 32-Cys 110 and Cys 53-Cys 62) observed in the 
mouse TIM-3 structure were missing from the resolved structure. A 2F, — F, elec- 
tron density map of the human CEACAM1 (IgV)-TIM-3 (IgV) single chain was 
calculated using PHENIX. All the figures were drawn using PyMOL (The PyMOL 
Molecular Graphics System, Schrodinger, LLC) and labels were added using Adobe 
Photoshop. 
Protein purification, crystallization, data collection and structure determina- 
tion of CEACAMI (IgV). Competent E. coli BL21 DE3 cells were transformed 
with a pET9a vector carrying the human CEACAM1 IgV domain gene insert with 
a N-terminal GST tag. To express the protein, transformants were grown in 1 litre 
of LB broth under antibiotic selection and induced with 0.1 mM IPTG after reach- 
ing an Agoo nm Of approximately 0.8. Cells were grown for an additional 16h at 
18 °C and collected by centrifugation. Cell pellets were suspended in 20 mM Tris-HCl 
buffer, pH 7.5, containing 150 mM NaCland 10% glycerol, and lysed by sonication. 
After centrifugation, supernatant was loaded onto a GST column (GE Healthcare) 
and washed several times with 50 mM Tris-HCl buffer, pH 7.5, containing 150 mM 
NaCl, 2.5 mM CaCl, 0.1% Triton X-100, and 10% glycerol. After on-column removal 
of the GST affinity tag with thrombin, protein was eluted with 25 mM Tris-HCl 
buffer, pH 7.5, containing 200 mM NaCl, 2.5 mM CaCl, and 10% glycerol. Eluted 
protein fractions, as judged by SDS-PAGE, were concentrated and purified through 
a Superdex 75 gel filtration column (GE Healthcare) by FPLC system with a buffer 
containing 50 mM Tris-HCl buffer, pH 7.5, containing 200 mM NaCl, 2.5mM 
CaCl, and 10% glycerol. A single peak for human CEACAM1 IgV domain was col- 
lected and the protein was concentrated to 5 mg ml ' for crystallization. A purity 
of >95%, was verified by SDS-PAGE. 

The initial crystal growth trials were performed in a 96-well format using a Phoe- 
nix robotic system (Art Robbins) with Index Screens 1 and 2 (Hampton Research) 
at room temperature. Optimized crystals were formed in well solution containing 
60% Tascimate, pH 8, and 1% B-octyl glucoside at room temperature. For data col- 
lection, crystals were cryoprotected in solution containing 60% Tascimate, pH 8 
and 18% glycerol. X-ray data were collected using beamline X6A at the National 
Synchrotron Light Source (NSLS). The data were processed with Mosflm” and the 
CCP4 software suite*’. The structure of the human CEACAMI N (IgV) domain was 
determined by the molecular replacement method using a modified N-terminal 
domain of human CEACAM 1 asa search model (PDB code 2GK2) as the starting 
model, with all residues changed to alanine to reduce model bias and all small mole- 
cule ligands, water and metals were removed. Molecular replacement was performed 
with Molrep, refinements were performed with PHENIX” and intermittent model 
building was performed with COOT”, as described above. Subsequent refinements 
cycles and model building with individual B-factors and Torsion-NCS restraints 
strategy led to the final crystallographic Rwork/Réree of 20.3%/24.2% using all data to 
2.0 A resolution. Electron densities were also identified for bound molecules of 
beta-octyl glucoside and malonic acid. The X-ray data and structure refinement 
statistics are shown in Extended Data Table 2. All residues were well defined and 
verified by PDB validation report (data not shown). A 2F, — F, electron density 
map was calculated using PHENIX”. 
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Structural modelling of CEACAM1-TIM-3 interactions. To model potential 
interactions between CEACAMI and TIM-3, superimposition data as shown in 
Extended Data Fig. 2d, e, of previously described immunoglobulin-variable (IgV) 
domains of mouse CEACAM1"* (PDB code 1L62) and mouse TIM-3 (PDB 20YP)'* 
was generated in Pymol (The PYMOL Molecular Graphics System, Schrodinger, 
LLC), which revealed structural similarity as shown by a score of 2.42 by root mean 
square deviation (r.m.s.d.) analysis as calculated by Pymol". In addition, multiple 
sequence alignments were performed using ClustalW”. From this analysis, we con- 
sidered mouse CEACAM1] and TIM-3 as structurally similar proteins. Furthermore, 
as previous structural data for mouse CEACAM1 supported the formation of 
CEACAM1 homodimers between its N-terminal domains”, and given our evidence 
that mouse CEACAM1 and TIM-3 are highly similar, we further reasoned that 
mouse TIM-3 and its human orthologue might heterodimerize with CEACAM1 
through their membrane-distal N-terminal domains. This hypothesis was mod- 
elled by us in the following manner. We aligned CEACAM1 symmetrically for cis 
modelling and asymmetrically for trans modelling along the five amino acid resi- 
dues within the FG-CC’ loops of mTIM-3 as reported previously’* that describe 
putative docking sites for an unknown ligand for mouse TIM-3. The ROSIE Dock- 
ing Server was used to analyse the docking (http://rosettadock.graylab.jhu.edu/ 
documentation/docking)***’. We identified the ten best scoring structures from 
the run in rank order by energy. The top two in cis and the best in trans are shown 
in Extended Data Fig. 2g. The score (y axis) indicates the energy of the models (the 
lower the better). The r.m.s. (x axis) indicates the RMS values between the input 
and output docking models. The docking models identified were further assessed 
in PyMOL to determine whether the models identified predicted hydrogen bonding 
within the putative mouse CEACAM1-TIM-3 interface. On the basis of this we iden- 
tified potential amino acids that may have an important role in mouse CEACAM1- 
TIM-3 binding, which is summarized in the table associated with Extended Data 
Fig. 2g. 

We used the amino acids of mouse CEACAM1 and mouse TIM-3 modelled at 
the interface between these two proteins to predict potential contact sites for the 
human orthologues of CEACAM1 and TIM-3. Furthermore, we searched human 
exomic databases for allelic variants in these predicted human amino acids. Finally, 
we used all of this information to make structural modifications of human CEACAM1 
and TIM-3 by site-directed mutagenesis to biochemically examine our hypothesis 
and confirm this by X-ray crystallographic analysis of a human CEACAM1-TIM-3 
single-chain protein. 

IBD case-control samples and genotyping. German Crohn’s disease and ulcera- 
tive colitis patients were recruited at the Department of General Internal Medicine 
of the Christian- Albrechts- University Kiel and Charité Universitatsmedizin Berlin, 
or nationwide with the support of the German Crohn and Colitis Foundation and 
the Bundesministerium fiir Bildung und Forschung (BMBF) competence network 
‘IBD’ (Extended Data Table 3a). The patients were classified according to clinical, 
radiological, histological and endoscopic (that is, type and distribution of lesions) 
according to accepted criteria**“’. Healthy control individuals were obtained from 
the popgen biobank”’. All controls were drawn from a population-representative 
sample. Given the low prevalence of both Crohn’s disease and ulcerative colitis and 
the fact that all control individuals self-reported to have neither Crohn’s disease 
nor ulcerative colitis, control individuals were designated ‘healthy’. Written, informed 
consent was obtained from all study participants and all protocols were approved by 
the institutional ethical review committees of the participating centres. The variants 
described in Extended Data Table 3b were genotyped using the Sequenom iPlex 
and the Life Technologies Taqman system. rs147827860, the single nucleotide poly- 
morphism encoding the hypomorphic Thr101le allele of TIM3, is a rare variant 
with a minor allele frequency of 0.004 in individuals of European descent predict- 
ing homozygous carriage with a frequency of 0.000016 (or 1:62,500). None of the 
publicly accessible databases lists any homozygous occurrences of this variant. Asso- 
ciation in the case-control data was tested with Haploview 4.2 (ref. 51). An exact 
test for Hardy-Weinberg equilibrium was performed using the DeFinetti program 
(Strom, T. M. & Wienker, T. F., http://ihg.gsf.de/cgi-bin/hw/hwal1.pl). 
Purification of IgV domain of human TIM-3. The TIM-3 IgV domain was expressed 
with a C-terminal GST tag as inclusion bodies from pET9a in E. coli BL21 DE3 cells. 
Inclusion bodies were refolded as described previously in refolding buffer contain- 
ing 200 mM Tris, pH 8.5, 0.4M arginine-HCl, 2mM EDTA, 5 mM cysteamine and 
0.5 mM cystamine”’. Refolded proteins were purified by affinity and gel filtration 
chromatography. The proteins were maintained in a solution containing 25 mM 
HEPES buffer, pH 7.2, containing 200 mM NaCl, 2.5 mM CaCl, and 5% glycerol. 
Circular dichroism was conducted to ascertain proper protein folding. A purity of 
>95%, was verified by SDS-PAGE. 

GST protein pull-down assays. Glutathione-S-transferase (GST) pull-down analyses” 
of the interaction between IgV domains of human TIM-3 and human CEACAM1 
were performed in the following manner. The GST-hTIM-3 protein was stored in 
refolding buffer, and buffer-exchanged with PBS before performing the assay. The 
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IgV domain of GST-hTIM-3 proteins were coupled to the glutathione-agarose 
column following the manufacturer’s instructions (Thermo scientific) and in-column- 
incubated with HEK293T cell lysates derived from cells transfected with Flag-tagged 
human CEACAM1. The columns were washed extensively and bound proteins were 
eluted by glutathione. Eluted protein complexes were immunoblotted for the detection 
of CEACAM1 (immunoblot with anti-Flag) and TIM-3 (immunoblot with anti-GST). 
TIM-3 ligand precipitation assays. Extracellular membrane-associated proteins 
on live TK-1 cells were labelled with biotin (EZ-Link Sulfo-NHS-LC-Biotin; Pierce). 
Whole-cell lysates were prepared and were incubated with 5 pg human [gG, full 
length TIM-3-Ig or soluble (s) TIM-3-Ig in the presence of protein G-agarose beads 
(Roche). Beads were washed and boiled with 1X SDS-PAGE loading buffer. Super- 
natants were collected by centrifugation, and half of each was digested with PNGase 
F (New England Biolabs). Samples were separated by SDS-PAGE and the TIM-3-Ig 
fusion protein-specific binding proteins were detected by immunoblot or sliver staining. 
Mouse model of IBD®. Splenic mononuclear cells were obtained from female wild- 
type, Ceacam1 ~~ Ceacam-4L"8 Ceacam1'~, Tim3"® Ceacam1! — Tim3"8 Ceacam1*!* 
(Tim3"®/wild-type) mice and CD4~ T cells were isolated using anti-CD4 (L3T4) 
MACS magnetic beads (Miltenyi Biotec) according to the manufacturer’s instruc- 
tions. Enriched CD4* T cells (94-97% pure as estimated by FACS) were then labelled 
with PerCP-conjugated anti-CD4, allophycocyanin-conjugated anti-CD44, and 
FITC-conjugated anti-CD62L and phycoerythrin-conjugated anti-TIM-3 (5D12 
or 2C12). Subpopulations of CD4* T cells were identified by three-colour sorting 
ona FACSAria (Becton Dickinson). All populations were defined as naive and 98.0% 
pure on re-analysis. To induce chronic colitis in animals, 5 X 10° CD4* cp44"° 
CD62L"®" (naive) T cells were adoptively transferred intraperitoneally into 6-10- 
week-old Ceacam1 ‘~ Rag2’~ recipient mice. Weights were measured every week, 
and mice were euthanized by CO; for histological evaluation of colitis and lamina 
propria mononuclear cells. Owing to the severity of colitis associated with adoptive 
transfer of naive Ceacam1 ’ CD4* T cells, significant mortality was observed result- 
ing in unequal group size at the at the time of culling necessitating post-hoc cor- 
rection in the statistical analysis (see ‘statistical methods’). 

Colorectal carcinogenesis models. To examine the effect of CEACAM1 and TIM-3 
on tumour incidence and multiplicity in an inflammation induced colorectal cancer 
model, mice were intraperitoneally injected at 6 weeks of age with the carcinogen 
AOM (Sigma-Aldrich) at 10 mg kg body weight as previously reported***°. One 
week later, the mice were started on the first of two 21-day DSS cycles, consisting of 
a period of 7 days with the tumour promoter DSS at 2.5% or 1.5% in the drinking 
water followed by 7 days of receiving regular water. Mice showing signs of morbidity 
were culled. Colons were removed and flushed with PBS buffer and cut longitudi- 
nally. Colon tissue sections were either paraffin-embedded for immunohistochem- 
istry or analysis of lamina propria mononuclear cells associated with the tumours. 
CT26 colorectal carcinoma cells were diluted into 10 X 10° cells per ml of PBS. After 
shaving, either wild-type or Ceacam1 ~'~ mice received 200 ul of the CT26 cell 
suspension subcutaneously in the left flank. All antibody treatments were applied 
intraperitoneally into the right flank according to the schedules described in Extended 
Data. Two-hundred micrograms of each antibody or its isotype control were admi- 
nistered according to the schedules described (CEACAM1I, ccl clone; TIM-3 5D12 
clone; PDL1, 10F.9G2) Tumour growth was assessed three times per week at 7 days 
after tumour inoculation by measuring the width and length of the tumours (mm) 
using calipers and the areas defined (mm’; width x length). Mice were euthanized 
on the basis of the extent of tumour growth observed in the isotype antibody treated 
control animals to minimize animal suffering in accordance with the approved 
animal protocol. 

Histopathological examination of colitis and colitis-associated colorectal can- 
cer. Colons were removed from mice after termination and dissected free from the 
anus to the caecum. Colonic contents were removed and colons cleaned with PBS 
before fixation in 4% paraformaldyhyde or 10% neutral buffered formalin followed 
by routine paraffin embedding. After paraffin embedding, 0.5-mm sections were 
cut and stained with haematoxylin and eosin. Sections were examined and colitis 
was scored in a blinded fashion (with respect to genotype and experimental pro- 
tocol) by one of the authors (J.N.G.). Each of four histological parameters for the 
severity of colitis was scored as absent (0), mild to severe (1-6): mononuclear cell 
infiltration, polymorphonuclear cell infiltration, epithelial hyperplasia, epithelial 
injury and extent of inflammation, modified froma previous study**. For AOM/DSS, 
colitis-associated colorectal cancer studies, an expert pathologist (J.N.G.) examined 
all tissues for dysplasia, adenoma, low-to-high-grade adenoma and adenocarcinoma 
in a blinded fashion. 

RNA, complementary DNA and quantitative PCR. RNA was isolated from whole 
cells using the Qiagen microRNA extraction kit following the manufacturer’s instruc- 
tions. RNA was quantified spectrophotometrically, and complementary DNA was 
reverse-transcribed using the cDNA archival kit (Applied Biosystems) following 
the manufacturer’s guidelines. All primers were obtained from Applied Biosystem. 
The cDNA samples were subjected to 40 cycles of amplification in an ABI Prism 


7900 Sequence Detection System instrument according to the manufacturer’s pro- 
tocol. Quantification of relative messenger RNA expression was determined by the 
comparative C, (critical threshold) method as described whereby the amount of 
target mRNA, normalized to endogenous cycles of amplification was determined 
by the formula 2744“, 

Gene expression analyses by NanoString. Naive T cells from wild-type, Tim3"® 
and Tim3"® Ceacam1 ‘~ mice were transferred into Ceacam1 '~ Rag2’'~ recipi- 
ents to induce colitis. For the analyses of transcriptional regulation, colonic infilt- 
rating lamina propria mononuclear cells were isolated at 6 weeks after T-cell transfer 
and immediately lysed in RLT buffer (Qiagen). For RNA analysis, 100 ng of total 
RNA was used and hybridized to a custom designed gene CodeSet (non-enzymatic 
RNA profiling using bar-coded fluorescent probes) according to the manufacturers 
protocol (Nanostring Technologies). Barcodes were counted (1,150 fields of view 
per sample) on an nCounter Digital Analyzer following the manufacturer’s protocol 
(NanoString Technologies, Inc.). 

Cytokine measurements. To measure cytokine production by ELISA, 1 X 10° CD4* 
T cells were cultured in 200 pl RPMI 1640 (Sigma-Aldrich) supplemented with 
10% heat-inactivated FBS, 500 U ml“ penicillin, 100 1g ml’ streptomycin (Sigma- 
Aldrich), 10 mM HEPES, 1% nonessential amino acids and 50 j.M 2-mercaptoethanol 
(Life Technologies Invitrogen), which was termed complete RPMI 1640, in the pres- 
ence of 5 pg ml * plate-bound anti-CD3 and 2 1g ml * soluble anti-CD28 mono- 
clonal antibodies on flat-bottom 96-well plates (Costar), at 37 °C in a humidified 
atmosphere incubator containing 5% CO, for 48 h or otherwise indicated. Culture 
supernatants were removed and analysed for the production of cytokines such as 
IFN-y, IL-2 or TNF-a. Cytokine concentrations were determined using specific 
ELISAs (R&D Systems) according to the manufacturer’s recommendations. To mea- 
sure cytokine production by intracellular cytokine staining on colitis-associated CD4* 
T cells obtained from the dissection of mesenteric lymph nodes, lymph nodes or 
lamina propria were incubated at 37 °C for 4-6 h in complete RPMI 1640 medium 
with 50 ng ml * PMA (Sigma-Aldrich), 500 ng ml ' ionomycin (Sigma-Aldrich), 
and 1 pl ml~’ GolgiPlug (BD). Surface staining was performed for 30 min, after 
which the cells were resuspended in fixation/permeabilization solution (Cytofix/ 
Cytoperm kit; BD). Intracellular cytokine staining using antibodies as described 
earlier was performed according to the manufacturer’s instructions. To measure 
cytokine production by intracellular cytokine staining in draining lymph node CD4* 
or CD8* T cells derived from CT26-tumour bearing mice, cells were stimulated with 
5 pg ml’ AH1-peptide (SPSYVYHQEF) or tumour-infiltrating lymphocytes (TIL) 
with 1 pg ml! plate-bound monoclonal antibody (mouse anti-CD3 antibody for 
48 h). The cells were counterstained with monoclonal antibody against CD8-FITC 
with or without AH1-tetramer (Medical Biological Laboratories) which detects the 
MuLV gp70 peptide (SPSYVYHQF) in the context of H-2L" and analysed by flow 
cytometry. 

Surface plasmon resonance. Studies were performed on Biacore 3000 (GE Health- 
care). GST-hTIM-3 protein, or GST alone as control, was immobilized on a CM5 
sensor chip using amine coupling chemistry as per the manufacturer’s instructions. 
The coupling was performed by injecting 30 1g ml of protein into 10 mM sodium 
acetate, pH 5.0 (GE Healthcare). HBS-EP buffer (0.01 M HEPES, 0.15 M NaCl, 
3mM EDTA, 0.005% surfactant P20; GE Healthcare) was used as running buffer 
and dilution buffer. Serial dilutions of analytes in HBS-EP were injected at 25 °C 
with a 25 yl min’ flow rate and data collected over time. The surface was regener- 
ated between different dilutions with 10 mM glycine-HCl, pH 2.5 (GE Healthcare). 
For blockade of single-chain binding to GST-hTIM-3, single-chain hCEACAM1- 
hTIM-3 protein was injected alone or together with antibody (anti-human CEACAM1 
monoclonal antibody, 26H7) or TIM-3-specific peptide (residues 58-77, 58-CPV 
FECGNVVLRTDERDVNY-77) with IgG1 mouse antibody or scrambled peptide 
(TLCVCFVNPYDVRVNDEREG) used as controls, respectively. All data were zero 
adjusted, and the reference cell value was subtracted. 

Isolation of lamina propia mononuclear cells and TILs. Colonic lamina propria 
mononuclear cells, colonic polyp-derived TILs and subcutaneous CT26 derived 
TILs were isolated as follows. Total colons from each group were longitudinally cut 
and washed with HBSS (free of Ca** and Mg”*) to remove faeces and debris. Sub- 
cutaneous CT26 tumours were dissected and dissociated in a gentle MACS dis- 
sociator (Miltenyi Biotec). Tumours, tumour-free colon pieces and CT26 tumour 
pieces were further finely minced and incubated in HBSS containing 5 mM EDTA, 
0.145 mg ml”! dithiothreitol, 1 M HEPES, 10% EBS, and 1% penicillin/streptomycin 
at 37 °C for 15 min for two cycles to dissociate the epithelial monolayers. EDTA was 
then removed by three washes in HBSS. The colon tissue and tumour specimens 
then were digested in RPMI 1640 containing 0.4 mg ml * collagenase D (Roche) 
and 0.01 mg ml ~ 1 DNase I (Roche) for 20 min (tumour-free colon pieces) or 30 min 
(tumour pieces) at 37 °C ona shaking platform and further digested with collagenase 
IV (Roche) at 0.01 mg ml! DNase I (Roche) for 20 min (tumour-free colon pieces) 
or 30 min (tumour pieces) at 37 °C on a shaking platform. After enzymatic treat- 
ment, digested tissue was passed through a 70-tm cell strainer and the flow-through 
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medium containing the mononuclear cells was collected and centrifuged at 400g 
for 10 min and pelleted and subjected to a 40:80% Percoll (GE Healthcare) gradient 
followed by centrifugation for 20 min at 2,500 r.p.m. at room temperature. Lamina 
propria mononuclear cells or TILs were collected at the interphase of the gradient, 
washed once, and resuspended in RPMI 1640 complete medium for further analyses. 
Statistical methods. Standard two-tailed t-test was applied throughout except for 
the following exceptions. A Welch t-test was applied in the setting of unequal vari- 
ance. The Mann-Whitney U test was applied when data were demonstrated to not 
follow a Gaussian distribution. In experiments where more than two groups were 
compared, one-way analysis of variance (ANOVA) was performed followed by 
application of post-hoc correction using Bonferroni’s multiple comparison test or 
Dunnett’s correction followed by Friedman test. Kruskal-Wallis with post-hoc Dunn’s 
correction was used in making multiple comparisons of groups with unequal size. 
Comparisons of mortality were made by Kaplan-Meier survival curve analysis with 
log-rank test to assess differences in cancer survival. Pearson’s coefficient for cor- 
relation (r) test was performed for comparison of variables. GraphPad Prism ver- 
sion 5.0b was used for calculations. P values of 0.05 were considered significant. 
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Extended Data Figure 1 | CEACAM1 is essential for TIM-3 mediated T cell 
tolerance. a, Schematic diagram of OVA antigen-specific tolerance induction 
model. b, Schematic diagram of OVA immunization. c, Tracking in vivo 
antigen-specific T-cell responses of CFSE-labelled OT-II transgenic Rag2‘~ 
T cells in total lymphocyte gate of mesenteric lymph nodes, peripheral lymph 
node or spleen of wild-type or Ceacam1~'~ recipients after gating on 
CFSE-positive cells and staining for CEACAM1 in PBS and OVA323_ 339 
immunized mice. Hyper-responsiveness of OT-II transgenic Rag2 ‘~ T cells in 
Ceacam1~'~ mice was not due to decreased regulatory T-cell induction (data 
not shown) or increased initial parking on the basis of cell numbers shown. 
d, TIM-3 expression on CEACAM1-positive and -negative CFSE" cells as in 
c.e, Schematic diagram of SEB-induced T-cell tolerance model. f, mCEACAM1 
and mTIM-3 expression on CD4* VB8°" T cells after SEB tolerance induction. 
g, hCEACAM1 and hTIM-3 expression on activated primary human T cells 
defined by staining with indicated antibodies. h, CEACAM1 expression on 
TIM-3-silenced primary human T cells after re-activation by flow cytometry. 
Relative TIM-3, CEACAM1 or CD4 expression on T cells expressing control 
shRNA (lacZ control, red) or three independent shRNAs directed at TIM3 
(overlay, blue). shRNA target sequences shown. i-], CEACAM1 and TIM-3 
expression and functional consequences on T cells in HIV infection. CD4* 
IFN-y* T cells are decreased among CEACAM1* TIM-3* CD4* T cells in 
HIV infection in response to Gag peptides (i). Although proportions of 
CEACAMI™* TIM-3* CD8" T cells are similar in HIV-infected and 
-uninfected subjects (j), CEACAM1* TIM-3* CD8* T cells express little 


LETTER 


IFN-y after stimulation with HIV Gag peptides or SEB relative to TIM-3* 
CEACAMI1 CD8" T cells (k, 1). C, hCEACAMI]; T, hTIM-3 (n = 4 per 
group, mean + s.e.m.). m-o, In situ proximity ligation analysis (PLA) of 
CEACAM1 and TIM-3. m, HEK293T cells transiently co-transfected with 
Flag-hCEACAM1 or HA-hTIM-3. Cells stained with DAPI (left), anti-tubulin 
(middle), anti-HA (rabbit) and anti-Flag (mouse) (middle right) or merged 
(right). Several examples of a positive PLA signal (middle right and right panels: 
red fluorescent dots) indicative of a maximum distance of 30-40 nm 

between hCEACAM1 and hTIM-3. n, Negative control, co-expression of 
Flag—PLK1 (protein kinase I) and HA-TIM-3 failed to generate fluorescent 
dots (that is, PLA negative). Cells stained with DAPI, anti-tubulin, anti-HA/ 
anti-Flag or merged as in m. 0, Negative control, co-expression of HA~ADAP 
(adhesion and degranulation promoting adaptor protein) failed to show a 
signal (that is, PLA negative) with staining as in m. p, gq CEACAM1 and TIM-3 
colocalization at immunological synapse of primary human CD4 and CD8 

T cells. Confocal microscopy of hTIM-3* hCEACAM1™ primary CD4* and 
CD8* T cells forming conjugates with SEB-loaded B cells. DIC, differential 
interference contrast. Blue denotes B cell; red denotes CD3; purple denotes 
CEACAM1; green denotes TIM-3. White indicates colocalization between 
CEACAM1 and TIM-3 (p). Average Pearson correlation coefficients for cD4* 
and CD8* T cells were 0.543 and 0.566, respectively, representing strong 
co-localization (q). Data are mean = s.e.m. and representative of five (f, g), 
four (p, q), three (c, d, m-o) and two (h) independent experiments. *P < 0.05; 
**P < 0.01; ***P < 0.001. 
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Extended Data Figure 2 | Structural similarities between CEACAM1 and Pymol. e, Sequence alignment of the IgV-like domains of mCEACAMI and 


TIM-3 IgV-like N-terminal domains and biochemical association. 

a-c, Interaction between TIM-3-Ig fusion protein and membrane protein of 
60 kDa after deglycosylation derived from surface-biotinylated TK-1 cells. 
TIM-3-Ig fusion proteins and human IgG-precipitated proteins were 
deglycosylated by PNGase F and separated by SDS-PAGE. TIM-3-Ig-binding 
membrane proteins detected by immunoblot. A 60-kDa membrane protein 
(red circles) and 32-kDa protein consistent with galectin-9 (black circles) are 
found specifically associated with soluble (s) TIM-3-Ig fusion protein (a, lane 
5) and full-length (f) TIM-3-Ig proteins (b, lane 5), but not with the pre-clear 
controls (lanes 3 and 4) or human IgG (lanes 2 and 6). c, sTIM-3-Ig and 
full-length (fl) TIM-3-Ig interacting proteins were de-glycosylated by PNGase 
F and separated by SDS-PAGE. Proteins detected by silver staining. A band 
of 60 kDa (red circle) is specifically associated with sTIM-3-Ig proteins (lane 2), 
but not with human IgG (lane 5), or TIM-1-Ig or TIM-4-Ig (lanes 3, 4, 6-10). 
d, Superimposition of previously described IgV-like domains of mCEACAM1 
and mTIM-3 demonstrate structural similarity with a score of 2.42 by the 
structural alignment and root mean square deviation (r.m.s.d.) calculated by 


mTIM-3 on the basis of the secondary structure alignment in d. f, Sequence 
alignments of IgV domain sequences of CEACAM1 and overall mTIM and 
hTIM family members. « helices (orange) and £ strands (blue) denoted as 
underlined segments in hCEACAM1 and mTIM-3. B strands labelled with 
upper- and lower-case letters for hCEACAM1 and mTIM-3, respectively. 
Conserved residues are shaded red. Mutated residues are shaded violet for 
hCEACAM1, and green for hTIM-3. Asterisk (*) indicates positions having a 
single, fully conserved residue; a dagger (+) indicates conservation between 
groups of weakly similar residues; a double-dagger ({) indicates conservation 
between groups of strongly similar residues. g, Computational modelling as 
defined by energy calculations (score) relative to r.m.s. values of docking 
models to define potential cis and trans interfaces between mCEACAM1 and 
mTIM-3 as described in Supplementary Information and amino acids 
involved. h, CEACAM1 expression on mouse fibroblast 3T3 cells used to 
identify a galectin-9-independent ligand. i, CEACAM1 expression on mouse 
TK-1 cells as in a-c. Representative of three (a-c, h, i) independent 
experiments. 
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Extended Data Figure 3 | Biochemical characterization of interactions 
between CEACAM1 and TIM-3. a, hTIM-3 does not co-immunoprecipitate 
(co-IP) with ITGAS5 despite interactions with hCEACAM1. HEK293T cells 
transfected with Flag-ITGA5 and HA-TIM-3 (ITGA5Tw) or Flag-CEACAM1 
and HA-TIM-3 (CwIw). Immunoprecipitation with anti-HA antibody and 
immunoblotted (IB) with anti-Flag antibody are shown. Input represents 
anti-Flag immunoblot of lysates. b, Co-immunoprecipitation of human TIM-3 
and CEACAMI from activated primary human T cells after N-glycanase 
treatment of lystates followed by immunoprecipitation with anti-human 
TIM-3 antibodies (2E2, 2E12 or 3F9) or IgG as control and immunoblotted 
with anti-human CEACAM1 antibody (5F4). Protein lystates from 
HeLa-CEACAMI transfectants treated with N-glycanase followed by 
immunoprecipitation with 5F4 and the immune complex used as positive 
control (pos). c, mTIM-3 interacts with mCEACAM1 in mouse T cells. 
Splenocytes from Ceacam1-48"® Ceacam1 ‘~ and Ceacam1-4L™ Ceacam1 ‘~ 
mice cultured with anti-CD3 (1 yg ml’) or anti-CD3 (1 Lig ml’) and 
anti-CD28 (1 pg ml ') or medium for 96 h. Cell lysates immunoprecipitated 
with anti-mCEACAM1 antibody (ccl) or with mlgG and IB with 5D12 
(anti-mTIM-3 antibody) are shown. Locations of mTIM-3 protein variants are 
indicated. CHO, carbohydrate. d, Immunoprecipitation and immunoblot as in 
a with tunicamycin treated, wild-type HA-hTIM-3 and Flag-hCEACAM1 co- 
transfected HEK293T cells. Arrowhead denotes core CEACAM1 protein. 

e, Potential hCEACAM1-interacting residues on hTIM-3 highlighted in blue. 
f, HEK293 T cells transiently co-transfected with Flag-hCEACAM1 and 
HA-hTIM-3 mutants. Immunoblotting of anti-HA were used to analyse 
hTIM-3 expression in HEK293T transfectants. Except for Pro50Ala mutation 
displaying enhanced overall protein expression, all other mutations in the 
IgV domain of hTIM-3 are equally detected by anti-HA antibody. 


g, Quantification of association of hTIM-3 mutants associated with wild-type 
hCEACAM1 shown in Fig. 2c summing all experiments performed. 
Association between wild-type hCEACAM1 and hTIM-3 core protein are 
depicted as reference (set as 1, n = 3, mean + s.e.m. shown, unpaired Student’s 
t-test). h, Immunoprecipitation with anti-Flag (nCEACAM1) and immunoblot 
with anti-HA (hTIM-3) or anti-Flag of wild-type hCEACAM1 and 

mutant hTIM-3 proteins are shown. i, Quantification of h as performed in 

g. j, HEK293T cells co-transfected with Flag-hCEACAM1 wild-type and HA- 
hTIM-3 mutants and immunoprecipitation/immunblot as in h revealing no 
effects of Cys52Ala or Cys63Ala mutations in hTIM-3 in affecting association 
with hCEACAM1 in contrast to Cys109Ala mutation of hTIM-3 that disrupts 
interactions with hCEACAML. k, Potential hTIM-3-interacting-residues 
around the FG-CC’ cleft of hCEACAM1 highlighted in red. 1, HEK293T cells 
transiently co-transfected with Flag-hCEACAM1 mutants and wild-type 
HA-hTIM-3. Immunoblot with anti-Flag antibody was used to analyse 
hCEACAM1 expression in HEK293T co-transfectants. All hCEACAM1 
mutations in IgV domain equally detected. m, Densitometric quantification of 
IgV domain hCEACAM1 mutations associating with wild-type HA-hTIM-3 
described in Fig. 2d. n—p, Analysis of Gly47Ala mutation of hCEACAM1 in 
hTIM-3 co-transfected HEK293T cells by immunoprecipitation with anti-HA 
(hTIM-3) and immunoblot with anti-Flag (nCEACAM1) to detect 
association (n), IB with anti-Flag to confirm similarity of hCEACAM1 
transfection (0) and quantification of associated hCEACAM1 of nas shown in 
m. q-s, Analysis of hCEACAM1 mutants Asn42Ala and Arg43Ala association 
with hTIM-3 (q), similarity of transfections (r) and quantification of q as in 
n-p. Representative of four (d, h), three (f, g, i, 1-s), two (a-c) and one (j) 
independent experiments. *P < 0.05; **P < 0.01; ***P < .001. 
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Extended Data Figure 4 | Structural analysis of hCEACAM1 and hTIM-3 
protein interactions. a, Schematic diagram of single-chain construct 
consisting of hCEACAM1 IgV-domain (amino acids 1-107), a linker 
consisting of (GGGGS),4 and hTIM-3 IgV-domain (amino acids 1-105) and 
C-terminal hexahistidine tag. b-e, Surface plasmon resonance analyses 

of hCEACAM1-hTIM-3 single-chain interaction with GST-hTIM-3. 

b, Representative sensorgrams of serial dilutions of hCEACAM1-hTIM-3 
single chain flowed over immobilized GST-hTIM-3 or GST alone. 

c, Representative sensorgrams of 600 nM hCEACAM1-hTIM-3 single-chain 
flowed over immobilized GST-hTIM-3 in presence of various concentrations 
of blocking hTIM-3 specific peptide (amino acids 58-77) or control scrambled 
peptide. d, Representative sensorgrams as in b in presence of various 
concentrations of anti-hCEACAMI1 monoclonal antibody (26H7) or control 
isotype antibody (mIgG1, MOPC). e, Bar graphs represent resonance units 
upon equilibrium (RUgq) of above treatments with mean + s.e.m. shown from 
>three runs. GST-hTIM-3 immobilized by amine coupling. Dilutions of 


CEACAM1 


CEACAM1 
CEACAM1 


hCEACAM1-hTIM-3 single chain, hCEACAM1-hTIM-3 single chain with 
either blocking hTIM-3-specific peptide, control scrambled peptide, and 26H7 
antibody or control MOPC antibody were injected over immobilized 
GST-hTIM-3 at 25 °C. Flow rate was 25 pl min |. f, g, 2F, — F, maps 
contoured at 0.90 showing electron densities for X-ray crystal structure of 
single chain hCEACAM1-hTIM-3 (PDB code 4QYC). h, Summary of crucial 
amino acid residues defined biochemically and structurally. i-k, Similarity 
between apo-hCEACAM1 and hTIM-3-associated CEACAM1. Structure of 
CEACAM1 homodimer at 2.0 A resolution (PDB code 4QXW) (i). Homophilic 
“YQQN’ concavity indicated consisting of residues Tyr 34, Gln 44, Gln 89 

and Asn 97 at hCEACAM1 (IgV)-hCEACAM1 (IgV) interface (j). 
Superimposition of IgV domain of hCEACAM1 monomer (orange) from i 
on hCEACAM1 (green) from hCEACAM1-hTIM-3 heterodimer in 

Fig. 2e (k). Representative of three (b-e) independent experiments. 

***P < 0.001. 
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Extended Data Figure 5 | CEACAM1 determines TIM-3 expression and 
function. a, HEK293T cells transiently co-transfected with Flag-hCEACAM1 
and wild-type or mutants of HA-hTIM-3. Flow cytometry detecting 
HA-hTIM-3 (detected with anti-HA) and Flag-hCEACAM1 (detected with 
5F4) proteins at cell surface (top), Golgi apparatus (middle) or endoplasmic 
reticulum (bottom) using monensin and brefeldin A, respectively. b, Cellular 
distribution of wild-type or mutant hTIM-3 when co-expressed with wild-type 
hCEACAM1. Total counts of hTIM-3 at surface, Golgi apparatus and 
endoplasmic reticulum summed up to 100%. Depicted as percentage of 
hTIM-3. c, HEK293T cells transiently co-transfected with wild-type 
HA-hTIM-3 (detected with 2E2) and wild-type or mutant Flag-hCEACAM1 
(detected with anti-Flag). Flow cytometry analyses as in a. d, Cellular 
distribution of c, as in b. Depicted as percentage of hTIM-3. e, Immunoblot 
for wild-type or Thr101 Ie variant of hTIM-3 showing maturation status in 
presence of wild-type or mutated (Gln44Leu) hCEACAM1. f, Normal 
association of Thr101[e variant of hTIM-3 with hCEACAM1. g, Analysis of 
CD4* V88* T cells after SEB tolerance induction from experimental mice 
of indicated genotypes. h, Galectin-9 induction of apoptosis. Annexin 

V* propidium iodide staining of T,1 cells polarized from Tim3™® or 

Tim3"® Ceacam1~'~ mice after treatment with galectin-9 (2 1g ml’) for 8h. 
Note decreased apoptosis in Tim3"® Ceacam1 '~ T cells. i, Schematic diagram 
of protocol used for protein pull-down using in-column IgV domain of 


GST-hTIM-3 incubated with hCEACAM1 protein derived from transfected 
HEK293T cells as in Fig. 2m. j, GST or GST-hTIM-3 staining of 
hCEACAM1-4L-transfected Jurkat T cells. k, Wild-type CD4* T cells 
stimulated with anti-CD3 and/or anti-CD28 in the presence or absence of 
mCEACAM1 NF¢, or IgG1-Fc as control, and cells analysed for secretion of 
IFN-y and IL-2. |, m, Characterization of tolerance in SEB model. Tim3'8 

(I) and Tim3"8 Ceacam1~/~ (m) mice treated with SEB with schedule described 
in Extended Data Fig. le. Lymph node cells collected after SEB treatment 
and re-stimulated with soluble anti-CD3 at indicated doses and IL-2 measured 
by ELISA after 72 h. Note tolerance in Tim3”® but not Tim3"® Ceacam1 ‘~ 
mice. n = 3 per group. n, Anti-mTIM-3 blockade with 2C12 antibody of 
mCEACAMI NEF or control IgG-Fc staining of CD4* T cells from indicated 
genotypes expressed as levels relative to Ceacam1~’~ mice. 0, p, Analysis of 
mTIM-3 cytoplasmic tail function in transmitting mCEACAM1-induced 
signals. Activated mouse CD4* T cells from wild-type (0) or Ceacam1 ~~ 
(p) mice were retrovirally transduced, sorted and stimulated with anti-CD3 
with either human IgG-Fc (IgG, control) or mCEACAM1 N-terminal domain 
as NFc and TNF-o secretion assessed by ELISA after 72 h. Note ability of 
CEACAM1 N-terminal domain to transduce a signal associated with inhibition 
of TNF-« secretion in wild-type but not Ceacam1~'~ T cells. n = 3 per 
group. Data are mean ~ s.e.m. and represent three (f, g, k-p) and two 

(a-e, h, j) independent experiments. *P < 0.05; **P < 0.01; ***P < 0.001. 
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Extended Data Figure 6 | CEACAM1 and TIM-3 cooperatively regulate 
inflammation and anti-tumour immunity. a, Representative haematoxylin 
and eosin staining of groups described in Fig. 3e. Scale bar, 50 jum. b, Flow 
cytometry for intracellular cytokine assessment of TNF-o expression from 
infiltrating CD4* T cells from inflamed colonic lamina propria of 
Ceacam1’~ Rag2/ ~ recipients, 6 weeks after transfer with naive CD4~ 
CD44" CD62L" T cells from indicated genotypes. c, Anorectal prolapse of 
indicated genotypes. d, Representative haematoxylin and eosin staining of 
groups described in Fig. 3g. Scale bar, 50 jim. e, RNA expression defined by 
nanostring of lamina propria mononuclear cells in indicated groups (mean of 
n= 3 per group). f, Schematic overview of protocol for AOM/DSS colitis- 
associated cancer model. g, Representative haematoxylin and eosin staining of 
colon from wild-type mice in AOM/1.5% DSS model. Scale bar, 50 jim; 

h, Representative photograph of distal colons of wild-type mice (n = 3 per 


Tim-3"° Ceacam1* 


™ Polyp 
™ Polyp vinicity 
= MLN 


group, anorectal junction at left end) in AOM/1.5% DSS model. Vertical arrows 
show the sites for dissection of the polyps (black) and the vicinity of the polyps 
(red). i, Representative flow cytometry analyses on infiltrating lymphocytes 
of invading distal colonic polyps or from the vicinity of the polyps or from 
mesenteric lymph nodes for CD4* and CD8* T cells and expression of 
CEACAM1 and TIM-3 or PD-1 and TIM-3. Note that vicinity of polyps exhibit 
highest numbers of T cells with an exhausted phenotype. j, Summary of flow 
cytometry on infiltrating lymphocytes from invading distal colonic polyps 

or from vicinity of polyps and from mesenteric lymph nodes for CD4* and 
CD8" T cells expressing CEACAM1 and TIM-3 or PD-1 and TIM-3 (n = 3, 
median shown). k, Representative pathology in AOM/1.5% DSS model. Scale 
bar, 60 um. HGD, high grade dysplasia. Representative of three independent 
experiments (a-e, g-k). 
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Extended Data Figure 7 | Blockade of CEACAM1 and TIM-3 or genetic loss 
of CEACAM1 increases anti-tumour immunity. a, Schematic presentation of 
antibody blockade protocol described in Fig. 4g. b, Schematic presentation 

of antibody blockade protocol referred to in panel c. c, Prevention of CT26 
tumour growth with indicated combinations of antibodies as in (b) (n = 5 per 
group, post-hoc Dunnett’s correction followed by Friedman test). d, Schematic 
of schedule used for therapeutic antibody administration as described in e. 

e, Synergy of CEACAM1 and programmed death-ligand 1 (PD-L1) blockade in 
a therapeutic protocol as described in d was performed in wild-type BALB/c 
mice that received a subcutaneous inoculation of CT26 tumour cells. Mean 
tumour size (n = 5 per group, with linear regression analysis). Note synergistic 
increase in anti-tumour effect when CEACAM1 and PD-L1 co-blockade was 


performed. f, TILs were analysed for the relative proportion of CD4* T cells 
that produced IL-10 as in Fig. 4g (n = 4, unpaired Student's t-test with 
Mann-Whitney U correction). g, Percentages of CD8* T cells from spleen 
show that antibody treatments have no effects on total CD8* T cell numbers 
(n = 7/8, unpaired two-tailed t-test). h, Negative correlation of the numbers 
of AH1 tet” CD8" T cells and the size of tumours in the draining lymph nodes 
from the tumour-bearing mice in Fig. 4g (Pearson’s correlation coefficient, 
r= 0.9560, P = 0.044). i, Representative flow cytometry for tumour-specific 
(AH1-tetramer, tet’) CD8* T cells in draining lymph nodes of mice from the 
indicated genotypes. Data are mean + s.e.m. and represent three (f-i), two (e) 
and one (c) independent experiments. *P < 0.05; ***P < 0.001. 
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Extended Data Table 1 | Primers for site-directed mutagenesis of hTIM-3 and hCEACAM1 


Extended Data Table 1a. TIM-3 mutant primers for site-directed mutagenesis 


No. Mutant Name Forward primer sequence Reverse primer sequence 
1 PS0A gggaacctcgtegccetctgctess cccagcagacggccacgagettccce 
2 C5S2A ectegtgcecgtegectgegecaaagea tectttgecccagg cgacggecacgage 
3 C58R* (rs201 750016) tgggecaaageagcccetcctetgtttgaate cattcaaacacagg acgg ectecetttgeccca 
4 E62A gcctgtectgtgtttgcatgtggcaacgtgste caccacgttgecacatgcaaacacaggacagec 
5 C63A gcaccacgttg ccagcttcaaacacagg acagect agcctgtcctgtgtttgaagctgecaacetgetec 
6 R69A gtggcaacgtgstectcgcgactgatgaaaggeate catecctttcatcagtegcgagcaccacgttgecac 
7 T101I* (rs147827860) gtecctgaccatagagaatgtgattctagcagacag ctgtctgctagaatcacattctctatggtcagegac 
8 CLO9A ggatttggatccgecagecetagatcccactgtctg cagacagtgggatctacgectgecggatccaaatce 
gi CLIOA cagtgggatctactgcgcccggatccaaatccca tgggatttgs atccgg ecgcagtagatcccactg 
10 RILLW* ~~ (rs145478313) gggatctactgctgctggatccaaatcccag ctgggatttgeatccagcagcagtagatcce 
11 D120A tecaaatcccaggcataatgaatgctgaaaaatttaacctgaagtt aacttcaggttaaatttttcagcattcattatgcctgggatttes 


b. CEACAM] mutant primers for site-directed mutagenesis 


No. Mutant name Forward primer sequence Reverse primer sequence 
il Y34C* (rs147100915) ttttggctacagctg stgcaaag geeaaagagteg ecactctttecectttgeaccagetgtagccaaaa 
74 N42A aggggaaagagtegatescecccetcaaattgtageatat atatcctacaatttgacgeececcatccactetttecect 
3 R43A ggaaagagtggatgocaacgctcaaattgtageatatgca tgcatatcctacaatttgagegttgecatecactetttce 
4 Q44L* (rs200708090) gagtggatgecaaccetctaattgtage atatgcaata tattgcatatcctacaattagacggttgecatccactc 
5 G47TA ggcaaccetcaaattgtagcatatgcaataggaactcaa ttgagttcctattgcatatgctacaatttgacggttgcc 
6 Q89H* (rs8111468) cag gattctacaccctacatgtcataaagtcagatcttg caagatctgactttatgacatgtaggetetagaatcctg 


a, hTIM-3 primers. b, hCEACAM1 primers. Mutations were chosen on the basis of orthologous human residues reported previously’? in mTIM-3 as involved in non-galectin-9 interactions with an unknown ligand 
(hTIM-3 residues 1,4, 6, 10 and 11 in a); orthologous human residues in hCEACAM1 (hCEACAM1 residues 1, 4, 5 and 6 in b) predicted to be involved in mTIM-3 interactions on the basis of studies described in 
Extended Data Fig. 2g and Supplementary Information; natural allelic variants of amino acid residues in hTIM-3 (residues 3, 7 and 10 in a) and hCEACAM1 (1, 4 and 6 in b) described above or others (hTIM-3 
residue 7 in a), which were extracted from human exomic databases. Amino acid residues predicted to be involved in hTIM-3 (IgV)-hCEACAM 1 (IgV) interactions on the basis of X-ray crystallographic structural 
models (hCEACAM1 residues 2 and 3 in b) as described in Supplementary Information, and cysteine residues involved in intrachain disulphide bonds adjacent to the CC’ and FG loops of hTIM-3 (Cys 58-Cys 109, 
Cys 52-Cys 63 and Cys 38-Cys 110) as described in Supplementary Information. Asterisks represent annotated human natural single nucleotide polymorphisms. 
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Extended Data Table 2 


Crystal information, data collection and refinement parameters 


Data collection statistics 


Space Group 

Cell Dimensions (A) 
Resolution (A)* 

No. of measurements 
Unique reflections 
I/sigma I 
Completeness (%) 
Redundancy 

Rmerge (%) 


Structure refinement 


Rwork (%) 

Ryree (%) 

R.m.s deviations 
Bond lengths (A) 
Bond angles (°) 

No. atoms 
Protein 
Ligand 
Water 

B Factors 

Protein 
Ligand 
Water 


PDB ID 


P4,2\2 

75.76, 75.76, 147.40 
36.85-3.40 (3.72-3.40) 
65773 

6397 

10.2 (6.5) 

99.9(100) 

10.3 (10.1) 

24.5 (49.9) 


34.5 
37.3 


0.014 
1.99 


3127 
19 
21.20 
30.0 


4QYC 


*Highest resolution shell is shown in parenthesis. 


hCEACAM1:hTIM-3 single chain CEACAM1 (Ig-V) 


P42,2 
107.16, 107.16, 61.60 
47.92-2.04 (2.11-2.03) 
219575 

23421 

8.8 (2.3) 

99.9 (100) 

9.4 (9.0) 

18.9 (98.4) 


20.3 
24.2 


0.014 
1.23 


1680 
47 
66 


23.30 
38.40 
28.60 


4QXW 
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Extended Data Table 3 | Genotype analysis for human T/M3 alleles in inflammatory bowel disease 


a. Study numbers 


c. Clinical data. 


Source ee Genoytpe distribution Clinical information 1 Basen 2 3 
German IBD patients 5,598 AA=3/AG=77/GG=5,518 Diagnosis CD cD UC 
German CD patients 3,975 AA=2/AG=54/GG=3,919 Sex male female male 
German UC patients 1,623 AA=1/AG=23/GG=1,599 Age of onset 33 13 40 
German healthy controls 3,928 AA=0/AG=50/GG=3,878 Disease localisation Colon Tleum, colon, rectum Colon 
EVS, European Americans 4,300 AA=0/A G=36/GG=4,264 


b. Studied 71M3 SNPs 


Disease characteristics 


Extraintestinal manifestations 


Treatment 


Surgery 


Other 


Snail-track ulcers, anal 
Entero-cutaneous fistula and stenosis, granulomas, 


: : . Perianal fistula 
colonic stenosis perianal abscesses and 


fistulas 
Pancreatitis Eye, joints, skin Skin 
Azathioprine, ASA, steroids Azathioprine, unknown 
corticosteroids 
Colostomy at age 44, two 
large bowel surgeries inthe Surgery at ages 19 and 21 unknown 
following 3 years 
Chronic active CD, 
Iron and selenium deficincy, refractory to Family history of 
hyperuricemia, depression immunosuppression, IBD 


hypertension 


CD patients 


UC patients 


IBD patients 


Healthy controls 


rs-number AA AB BB AA AB BB AA AB BB AA AB BB 
18201054625 2329 0 0 1448 0 0 a7TT 0 0 2913 0 0 
18145478313 2302 2 0 1445 0 0 3747 2 0 2875 3 0 
18190484372 2318 0 0 1439 0 0 S757 0 0 2894 0 0 
18147827860 2284 42 2 1423 22 1 3707 64 3 2879 32 0 
1835960726 2305 24 0 1425 22, 0 3730 46 0 2861 51 0 
1181855375 2308 0 0 1442 0 0 3750 0 0 2896 0 0 
1s184868814 2328 1 0 447 0 0 3775 1 0 2912 1 0 
18201750016 2275 0 0 433 0 0 3708 0 0 2839 0 0 
1841283181 2314 1 0 444 0 0 3758 1 0 2894 0 0 
15147605860 2329 0 0 447 0 0 3776 0 0 2913 0 0 
1s142180056 2329 0 0 1448 0 0 3777 0 0 2913 0 0 


Genotype distribution for 11 TIM3 variants in CD, UC and IBD patients and healthy controls. A specifies the major, B the minor allele. 


a, Number of successfully genotyped German patients and contro! 


s and corresponding distribution of genotypes for the Thr101Ile missense variant in T/M3, including the genotype distribution in individuals of 


European descent (European Americans) downloaded from the Exome Variant Server (EVS), NHLBI GO Exome Sequencing Project (http://evs.gs.washington.edu/EVS/). Genotyping 11 rare variants in T/M3, ina 


cohort of 5,634 inflammatory bowel disease and 3,940 control su 


jects, we observed homozygous rs147827860 (Thr101lle) carriage exclusively in three IBD subjects (2 Crohn’s disease (CD) and 1 ulcerative 


colitis (UC)), but no control subjects which significantly deviated from Hardy-Weinberg equilibrium in IBD cases (P = 0.0033) but not controls (P = 1.00). b, Description of TIM3 single nucleotide polymorphisms 


(SNPs) analysed in cohorts described in a. c, Clinical data for the 


hree patients found to be homozygous for the rs147827860 (Thr101 Ile) variant. 
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An ERK/CdkS5 axis controls the diabetogenic actions 


of PPARy 


Alexander S. Banks!, Fiona E. McAllister”, Joo Paulo G. Camporez?, Peter-James H. Zushin', Michael J. Jurczak?, 
Dina Laznik-Bogoslavski*, Gerald I. Shulman’, Steven P. Gygi? & Bruce M. Spiegelman*"* 


Obesity-linked insulin resistance is a major precursor to the develop- 
ment of type2 diabetes. Previous work has shown that phosphor- 
ylation of PPARy (peroxisome proliferator-activated receptor y) at 
serine 273 by cyclin-dependent kinase 5 (Cdk5) stimulates diabeto- 
genic gene expression in adipose tissues’. Inhibition of this modifi- 
cation is a key therapeutic mechanism for anti-diabetic drugs that 
bind PPARy, such as the thiazolidinediones and PPARy partial 
agonists or non-agonists”. For a better understanding of the import- 
ance of this obesity-linked PPARy phosphorylation, we created mice 
that ablated Cdk5 specifically in adipose tissues. These mice have 
both a paradoxical increase in PPARy phosphorylation at serine 273 
and worsened insulin resistance. Unbiased proteomic studies show 
that extracellular signal-regulated kinase (ERK) kinases are activated 
in these knockout animals. Here we show that ERK directly phos- 
phorylates serine 273 of PPARy in a robust manner and that Cdk5 
suppresses ERKs through direct action on a novel site in MAP kinase/ 
ERK kinase (MEK). Importantly, pharmacological inhibition of MEK 
and ERK markedly improves insulin resistance in both obese wild- 
type and ob/ob mice, and also completely reverses the deleterious 
effects of the Cdk5 ablation. These data show that an ERK/Cdk5 axis 
controls PPARy function and suggest that MEK/ERK inhibitors may 
hold promise for the treatment of type 2 diabetes. 

Obesity is characterized by dysfunctional adipose tissues in which 
failure to store excess energy appropriately leads to ectopic lipid depos- 
ition, progressive insulin resistance and heightened risk for type 2 diabetes. 
Disordered secretion of certain fat-derived hormones, called adipokines, 
also contributes to the metabolic dysfunction in obesity and diabetes. 
Adipose-tissue-directed insulin-sensitizing drugs, including the thia- 
zolidinediones, potently improve whole-body sensitivity to insulin’. The 
thiazolidinedione drugs have two distinct functions as ligands for PPARy: 
they promote the differentiation of preadipocytes** and they block the 
phosphorylation of PPARy at $273 (ref. 1). We recently demonstrated 
that non-agonist PPARy ligands capable of blocking the phosphoryla- 
tion of PPARy at S273 retain potent anti-diabetic effects despite the inab- 
ility to promote adipogenesis’. These findings strongly suggested that 
obesity-mediated phosphorylation of PPARy S273 may not only correl- 
ate positively with the development of insulin resistance but may also 
be causal to this state. 

A variety of protein kinases participate in insulin action and insulin 
resistance. Insulin signalling activates the Akt/phosphoinositide 3-kinase 
(PI(3)K) and Grb2/Ras/MEK/ERK kinase cascades®*’. Although much 
is known about the role of the former in promoting the canonical ana- 
bolic actions of insulin, studies in vitro had suggested that the latter 
cascade downstream of insulin signalling could contribute to insulin 
resistance®”, although there is controversy on this point’®. Obese rodents 
were shown to have elevated ERK activity, whereas mice lacking ERK1 
were shown to be more sensitive to the effects of insulin®’”””. 

Cdk5 function is both necessary and sufficient in cultured adipo- 
cytes to phosphorylate PPARy at S273 (ref. 1). Mice with global or 


brain-restricted deletion of Cdk5 show increased perinatal mortality 
due to a defect in neurogenesis. We therefore set out to test whether 
modulation of PPARy phosphorylation at $273 in adipose tissues 
would lead to altered insulin sensitivity in vivo by creating adipose- 
selective Cdk5-deficient mice (Cdk5-KO)'*"*. In contrast to global 
knockouts!*"°, Cdk5-KO mice are grossly normal in appearance, with 
no apparent differences in body weight or fasting glucose levels when 
maintained ona standard diet (Extended Data Fig. 1). Deletion of Cdk5 
in whole white adipose tissue was confirmed by both western blot ana- 
lysis (Fig. la) and quantitative real-time PCR (Fig. 1b). To determine 
whether the residual Cdk5 expression in the knockout (KO) mice was 
emanating from non-adipocytes or from incomplete recombination, tis- 
sue fractionation was performed; no detectable Cdk5 protein was observed 
in the floating adipocyte fraction, whereas residual signal was observed 
in the stromal vascular fraction (Fig. 1c). Ona standard chow diet, KO 
mice were normal, healthy and indistinguishable from Cdk5"'*/"!* 
controls (Extended Data Fig. 1). 

Both PPARy S273 phosphorylation and insulin resistance are strongly 
promoted by obesity and inflammatory cytokines’. When maintained on 
a high-fat diet to induce obesity, no differences were observed between 
wild-type (WT) and KO groups in food intake, energy expenditure or 
body weight (Fig. 1d and Extended Data Fig. 2). Paradoxically, meta- 
bolic analyses of these Cdk5-KO mice demonstrated that their glucose 
homeostasis was impaired in comparison with control animals. Cdk5- 
KO mice had elevated fasting insulin levels, as well as impairment in insulin 
tolerance, with a trend towards impaired glucose tolerance (Fig.1e-h). 
Wealso observed a paradoxical increase in $273 PPARy phosphoryla- 
tion in obese Cdk5-KO mice, strongly suggesting compensation from 
an alternative protein kinase (Fig. 1i, j). 

To understand how PPARy S273 phosphorylation is increased in the 
absence of Cdk5, unbiased quantitative proteomic kinase profiling was 
performed on white adipose tissue (Fig. 2a). The most enriched protein 
kinase-derived peptide in KO mice (VADPDHDHTGFLTEpY *’VATR) 
corresponded to the activation loop of MAP kinase, ERK2/Mapk1 (Fig. 2b). 
We independently confirmed that ERK2 was activated in adipose tis- 
sue extracts from the KO mice by examining the phosphorylation of 
ERK2 at T183 and Y185, using phospho-specific antibodies against 
ERKs (Fig. 2c). We found no significant differences in the activation of 
alternative obesity-linked kinases (Extended Data Fig. 3). In addition, 
elevated T183/Y185 ERK phosphorylation was observed in adipocytes 
from Cdk5-KO mice that were differentiated for 8 days in vitro, suggest- 
ing that this phenomenon is cell-autonomous (Fig. 2d). A small-molecule 
inhibitor of Cdk5, roscovitine’’, also promotes ERK phosphorylation 
in cultured adipocytes, although here the competing inhibition of Cdk5 
and activation of ERK had a net effect of leaving PPARy S273 phos- 
phorylation unchanged (Fig. 2e). 

Because even the best available inhibitors of Cdk5, such as roscovitine, 
are not completely specific for Cdk5 (refs 17, 18), we also used a more 
precise means of regulating this kinase: an allele of Cdk5 specifically 


1Division of Endocrinology, Diabetes and Hypertension, Brigham and Women's Hospital and Harvard Medical School, Boston, Massachusetts 02115, USA. @Department of Cell Biology, Harvard Medical 
School, Boston, Massachusetts 02115, USA. Yale Mouse Metabolic Phenotyping Center and Departments of Internal Medicine and Cellular and Molecular Physiology, Yale University School of Medicine, 
New Haven, Connecticut 06510, USA. “Dana-Farber Cancer Institute, Boston, Massachusetts 02115, USA. 
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Figure 1 | Insulin resistance after Cdk5 deletion in adipocytes. a, b, Deletion 
of Cdk5 in epididymal white adipose tissue from control (Cdk5"*"™) or 
adipocyte-specific knockout, KO (Cdk5""'™*::adiponectin-Cre) was 
confirmed by western blotting (a; n = 4) or quantitative real-time PCR 

(b; n = 5). Ctl, control. c, Fractionated adipose tissue confirmed deletion was 
confined to the adipocyte fraction of adipose tissue. d, Body weight of control or 
KO mice when maintained on a high-fat diet. n = 20 Ctl, 25 KO. e, f, Fasting 
glucose (e) and fasting insulin (f) in mice maintained on a high-fat diet. 

n= 10 (control) and 12 (KO). g, h, Insulin tolerance test (g) and glucose 
tolerance tests (h) are consistent with impaired insulin sensitivity. n = 15 
(control) and 17 (KO). Histograms show the areas under the curves. i, Western 
blots of white adipose tissue for pS273 PPARy in control and KO mice. 

j, Quantification of i. *P < 0.05; **P < 0.01; ***P < 0.001; n.s., not significant. 
Error bars indicate s.e.m. 


designed to be inhibited by the ‘bulky’ small molecule INMPP1. With 
the use of this previously validated approach’’”°, a dose-dependent 
increase of ERK phosphorylation was observed after Cdk5 inhibition 
with INMPP1, in both fibroblasts and cultured adipocytes (Fig. 2f, g). 
Thus, we observed elevated levels of an activating ERK phosphoryla- 
tion as a consequence of Cdk5 loss in vivo or ex vivo. 

We next investigated whether ERK kinase might be capable of directly 
compensating for Cdk5 deficiency in the phosphorylation of PPARy. 
ERK and Cdk5 are structurally similar Ser/Thr kinases with a propensity 
for phosphorylating sites with proline in the +1 position”’. In cultured 
cells, both constitutively active ERK (ERK-CA) and active Cdk5 (Cdk5 
and its activating subunit p35) phosphorylated PPARy at S273 (Fig. 2h). 
We confirmed that this was a direct effect by performing in vitro protein 
kinase assays on recombinant full-length PPARy. Both ERK2 and Cdk5 
resulted in the direct phosphorylation of S273 PPARy, but a third kinase, 
MEKz2, failed to phosphorylate this site (Fig. 2i). Cdk5 and ERK both 
phosphorylated the Ser-Pro sites $273 and $112 (Fig. 2i). The action of 
ERK on S112 has been reported previously by us and others”. A novel 
Cdk5-specific target site at T296 was also identified. In contrast, both 
ERK and MEK phosphorylated PPAR; at only one common site, $133. 
Because thiazolidinediones can block the ability of CDK5 to phosphor- 
ylate S273 PPARy (ref. 1), we sought to determine whether they would 
similarly block the action of ERKs. ERK phosphorylates both $112 and 
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Figure 2 | Identification and characterization of ERK as a S273 PPARy 
kinase. a, ATP-probe-enriched phosphoproteomic analysis of kinases in 
control or Cdk5-KO adipose tissue from mice fed on a high-fat diet. Heat 
map indicating the most highly regulated phosphopeptide between control and 
Cdk5-KO (n = 3 per group; P = 0.08) corresponding to the peptides containing 
the activation loop of Mapk1/ERK2. b, Normalized quantification of the 
abundance of the phosphopeptide (left) and total abundance (right) 
corresponding to the activation loop of ERK2, determined by mass 
spectrometry. c, Western blot of ERK1 and ERK2 phosphorylation in brown 
adipose tissue from mice on a high-fat diet. d, Western blot of phospho-ERK1/2 
in primary adipocytes differentiated in vitro for 8 days and serum starved for 
18h. e, Inhibition of Cdk5 by treatment with roscovitine for 6h in cultured 
F442A adipocytes at the indicated doses. f, HEK 293 cells expressing WT 
Cdk5 or an analogue-sensitive (AS) mutant of Cdk5 were treated with the 
AS-specific inhibitor INMPP1 at 0, 0.1, 1.0 or 10 1M for 2h. g, Cultured 
adipocytes stably expressing Cdk5-AS treated with the indicated doses of 
1NMPP1. h, PPARy in HEK 293 cells co-transfected with increasing doses of 
constitutively active ERK2 kinase (ERK-CA) or active Cdk5 (Cdk5 with p35). 
Western blotting was performed for both pS273 PPARy and total PPARy. 

i, Phosphorylated residues identified by LC-MS/MS after in vitro kinase assay 
of recombinant Cdk5, ERK2, or MEK2 incubated with full-length recombinant 
PPARY. j, In vitro ERK kinase assay with incubated with full-length 
recombinant PPARy and increasing doses of pioglitazone before western 
blotting for pS273 and total PPARy. NT, no treatment. Error bars 

indicate s.e.m. 


S273, but increasing concentrations of pioglitazone block phosphor- 
ylation only at S273 (Fig. 2)). 

To determine how Cdk5 might be regulating ERK, we again turned 
to ATP probes and quantitative proteomics to identify Cdk5 substrates. 
Cdk5-deficient adipose tissue extracts were spiked with increasing doses 
of recombinant active Cdk5 kinase, plus p35 (Fig. 3a). MEK2, the kinase 
upstream of ERK, was identified as the protein with the single greatest 
dose-dependent increase in phosphorylation (Fig. 3b). The phospho- 
peptide identified contained two closely spaced potential phosphothreo- 
nine sites at T395 and T397 of mouse MEK2. Although Cdk5 has been 
reported to regulate MEK] at T286, this site was not conserved in MEK2 
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Figure 3 | Regulation of MEK2 by Cdk5. a, ATP-probe phosphoproteomic 
analysis of activated kinases in Cdk5-KO adipose tissue lysates after addition of 
recombinant active Cdk5 and p35 protein at the indicated doses. b, The 
most highly regulated peptide includes MEK2 residues T395 and T397. 

c, HEK 293 cells expressing the AS mutant of Cdk5 were treated with the 
AS-specific inhibitor INMPP1 at 0, 0.1 1.0 or 10 11M for 2h before western 
blotting for MEK2 phospho-T395. d, Immunoprecipitation in vitro kinase 
assay of MEK2-WT or T395A T397 (MEK2-AA) mutant. Flag-tagged 
WT-MEK2 or constitutively active CA-MEK2 (S222D S226D) were 
immunoprecipitated from HEK 293 cells and incubated with ATP and 
recombinant ERK protein. 


(Extended Data Fig. 4)°°. The function of these two MEK2 sites (T395 
and 1397) has not previously been reported, yet they are the most 
frequently found MEK2 modifications in the proteomic databases out- 
side the canonical activation loop”’. 

Using a phospho-specific antibody, we were able to show that acute 
Cdk5 inhibition abrogates the phosphorylation of MEK2 at T395 (Fig. 3c). 
Mutation of both of these neighbouring sites rendered MEK more active 
than wild type in a protein kinase assay using recombinant ERK protein 
as a substrate (Fig. 3d). Together, these findings strongly suggest that 
Cdk5 deletion results in the activation of ERK kinase via derepression 
of MEK kinase activity. 

We next asked a critical question: does inhibition of the MEK/ERK 
pathway correct the metabolic defect evident in the Cdk5-KO mice? 
Mice of both WT and KO genotypes maintained on a high-fat diet were 
treated with the well-characterized MEK inhibitor PD0325901 for 5 days 
before a glucose tolerance test. Inhibition of MEK was able to normalize 
glucose tolerance completely in these two groups, consistent with the 
role of ERK as a key compensating kinase in Cdk5-deficient adipose 
tissue (Fig. 4a—c). This occurred with no effect on adiposity (Extended 
Data Fig. 5). 

To gain a better understanding of the role of ERK in the pathophys- 
iology of diabetes we examined insulin sensitivity by performing insulin 
tolerance tests and hyperinsulinaemic-euglycaemic clamp experiments 
on diet-induced obese wild-type C57Bl/6 mice. Mice treated with 
PD0325901 showed a late divergence in glycaemia 90 min after admin- 
istration of insulin, suggesting an exaggerated response to insulin action 
(Fig. 4d). Similarly, clamp studies revealed a twofold increase in the glu- 
cose infusion rate (Fig. 4e). This markedly improved sensitivity to insu- 
lin infusion was due to increased whole-body glucose utilization and 
improved insulin-mediated suppression of both endogenous glucose 
production and lipolysis (Fig. 4f, i, j). Tracer analysis indicated that 
insulin sensitivity was increased more strongly in adipose tissue than in 
skeletal muscle; however, both tissues contributed to the effects of ERK 
inhibition on glucose disposal (Fig. 4g, h). 

To examine further the efficacy of targeting MEK and ERK pharma- 
cologically, we studied severely insulin-resistant leptin-deficient obese 
(ob/ob) mice. We found elevated phospho-ERK levels in white adipose 
tissue from unchallenged ob/ob mice in comparison with lean WT controls 
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Figure 4 | Metabolic consequences of MEK inhibition in vivo in mice fed 
with a high-fat diet. a, b, Glucose tolerance tests of high-fat-diet-fed adipose- 
specific Cdk5-KO mice or controls after treatment with vehicle (a) or MEK 
inhibitor PD0325901 (PD) (b), c, Area under the curves in a and b. n = 10 
for Ctl vehicle, KO vehicle and KO PD groups; n = 9 for the Ctl PD group. 
d, Insulin tolerance tests (n = 12 Ctl, 11 KO) in high-fat-diet-fed wild-type 
C57B1/6 mice after treatment with PD0325901. e-j, Hyperinsulinaemic- 
euglycaemic clamps in high-fat-diet-fed wild-type C57BI/6 mice after 
treatment with PD0325901. e, Glucose infusion rate. Veh., vehicle. f, Whole- 
body glucose uptake/disposal. g, *H-2-deoxyglucose (2DG) tracer uptake into 
epididymal white adipose tissue. h, 2DG tracer uptake into gastrocnemius. 

i, Percentage suppression of endogenous glucose production. j, Percentage 
suppression of free fatty acids. n = 9 vehicle, 11 PD. *P< 0.05; **P<0.01; 
***P < 0.001. Error bars indicate s.e.m. 


(Fig. 5a). We next treated ob/ob mice with either PD0325901 or a distinct 
US Food and Drug Administration-approved MEK inhibitor, Trametinib/ 
GSK1120212. In comparison with control animals, mice receiving either 
GSK1120212 or PD0325901 showed an improvement in glucose tol- 
erance (Fig. 5b and Extended Data Fig. 6a). This was accompanied by 
decreased insulin levels and increased levels of the insulin-sensitizing 
hormone adiponectin (Fig. 5c, d and Extended Data Fig. 6b) without 
affecting body weight (Fig. 5e and Extended Data Fig. 6c). 

We performed gene expression analysis to gain a better understand- 
ing of the transcriptional basis of the improved glucose homeostasis 
after MEK inhibition. We have previously defined a set of 17 genes that 
are sensitive to PPARy S273 phosphorylation in cultured adipose cells 
and then further refined that gene set to 10 genes that were also regu- 
lated by obesity in mice and responded to treatment with non-agonist 
PPARy ligands’. Although treatment with MEK inhibitors did not affect 
the degree of obesity, expression of all of these 10 genes was significantly 
regulated by treatment with one or both of the MEK inhibitors used in 
this study (Fig. 5f). This included the genes encoding the circulating 
insulin sensitivity factors adiponectin and adipsin, previously shown to 
be most sensitive to phosphorylation of PPARy at S273. The direction 
of all of these changes was consistent with that predicted by the reversal 
of $273 phosphorylation. Conversely, the MEK inhibitors had a min- 
imal effect on the expression of genes linked to PPARy agonism and 
adipogenesis, including aP2 and Cebpa (Fig. 5g). We also found increased 
expression of genes participating in the induction of thermogenesis and 
‘browning of white adipose tissue (Fig. 5h). This thermogenic program 
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does not include induction of the PPARy coactivators Pgcla or Prdm16, 
although post-translational modifications of these proteins cannot be 
excluded as contributing to this phenotype”. Last, we observed decreased 
expression of the pro-inflammatory cytokine Tnfa messenger RNA and 
an altered adipose tissue macrophage expression profile (Extended 
Data Fig. 7). Both MEK inhibitors caused a decrease in PPARy phos- 
phorylation at S112 and S273, confirming the established role of ERKs 
in regulating $112 and strongly suggesting a new role in regulating $273 
(refs 22-24) (Fig. 5i). Taken together, these data identify an insulin- 
sensitizing role for MEK inhibitors in adipose tissue, which is consist- 
ent with the effects of non-agonist PPARy ligands that specifically block 
PPARy phosphorylation at $273 (Extended Data Fig. 8). 

The MEK inhibitory compounds that we have used here are now 
safe and effective enough to be used in patients with metastatic mela- 
noma” and are tolerated well enough to permit studies of metabolism 
in rodents and perhaps in humans. We find anti-diabetic effects when 
these MEK inhibitors are given at doses threefold lower than were used 
in rodent tumour xenograft models”’. This suggests that there may be a 
therapeutic window for improving insulin sensitivity via PPARy, using 
either a safe, low-dose treatment of a MEK inhibitor, a non-agonist 
ligand that blocks kinase accessibility to PPARy S273, or both together. 
These data offer hope for resurrecting PPARy-targeted therapeutics to 
improve whole-body insulin sensitivity. 

Online Content Methods, along with any additional Extended Data display items 


and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Animal experiments. Animal experiments were performed with approval from 
the Institutional Animal Care and Use Committees of both Beth Israel Deaconess 
Medical Center and The Harvard Center for Comparative Medicine. Glucose and 
insulin tolerance tests were performed as described previously by researchers blinded 
to the genotype and treatment group to which each mouse belonged’’. Glucose 
doses used were as follows: for mice maintained ona standard-diet, 2 g kg” '; high- 
fat diet, 1.5 gkg_'; ob/ob mice, 1 g kg *. Male C57B16/J ob/ob mice were purchased 
from Jackson Labs at 5-6 weeks of age and allowed to acclimate for 1-2 weeks 
before treatment. Cdk5"'’" mice were provided by P. Greengard (Rockefeller 
University). Cdk5"“"* control, adiponectin-Cre and the Cdk5-KO (Cdks "°F: 
adiponectin-Cre) mice were healthy and viable, in contrast to mice with whole- 
body deletion of CDK5. Adiponectin-Cre mice were provided by E. Rosen (Beth 
Israel Deaconess Medical Center). Both strains were previously backcrossed to a 
C57B1/6 background. For diet-induced obesity, male animals were fed on a high-fat 
(60%) diet (Research Diets, catalogue no. D12492i). For in vivo therapeutic assays, 
GSK1120212 (3 mgkg” ') or PD0325901 (refs 32-34) (10 mg kg” ') (Selleckchem) 
was administered by daily oral gavage for 5 days unless otherwise specified. Com- 
pounds were dissolved in dimethylsulphoxide and diluted into an aqueous 250-l 
dose containing 0.5% hypromellose and 2% Tween-80. High-fat-diet mice were 
switched to and maintained on a standard chow diet 48 h before the first dose of 
MEK inhibitors. Sample sizes were based not on power calculations but on the 
maximum number of mice that could be bred to within 2-3 weeks in age to main- 
tain well-matched controls. For randomization of groups, mice were ranked accord- 
ing to body weight and alternated between vehicle and drug treatment groups. Mice 
were excluded from randomization if body weight was more than two standard 
deviations from the mean. Hyperinsulinaemic-euglycaemic clamp studies were per- 
formed as described previously’. 

Cell culture. Cell culture of HEK 293 and F442A pre-adipocytes was performed as 
described: protein and RNA preparation, western blotting, quantitative real-time 
PCR with expression normalized to levels of TATA-binding protein (TBP). Cell lines 
were found free of mycoplasma before initiation of studies. Inhibition of analogue- 
sensitive CDK5 kinase (F80G) was performed by treating cells with INMPP1 (1- 
(1,1-dimethylethyl)-3-(1-naphthalenylmethyl)-1H-pyrazolo[3,4-d]pyrimidin-4- 
amine) (Cayman Chemical) at the indicated doses for 2h. 

Plasmids. The constitutively active ERK kinase, ERK-CA, is the product of a fusion 
of MEK1 with ERK2 (ERK2-MEK1-LA) and was provided by M. Cobb”. The inserts 
encoding human WT-MEK2 and CA-MEK2 (S222D and S226D) from Addgene 
(catalogue nos 40776 and 29580) were cloned into Flag-pCDNA3.1. The F80G 
mutation of haemagglutinin-tagged Cdk5 was introduced using the Quikchange 
XL Site Directed Mutagenesis Kit (Agilent). This insert was cloned into the pMSCV 
backbone retrovirus for stable infection of F442A pre-adipocytes. 

Antibodies. Antibodies were obtained from Cell Signaling Technology (anti- 
phospho-ERK1/2 (catalogue no. 9101), anti-ERK (catalogue no. 4695), anti-phospho 
p38 (catalogue no. 4511), anti-p38 (catalogue no. 9212), anti-phospho JNK (cata- 
logue no. 4671), anti-JNK (catalogue no. 9252), anti-phospho AKT (catalogue no. 
13038) and total AKT (catalogue no. 9272)), Millipore (anti-phospho-S112 PPARy 
(catalogue no. 04-816)) and Santa Cruz Biotechnology (anti-PPARy (catalogue no. 
sc-7273) and anti-phospho-394-MEK2 (catalogue no. sc-101734)). Antibodies 
against anti-phospho-S273 PPARy were generated as described previously’. 
Indirect calorimetry. Energy expenditure, O, consumption, CO, production, 
respiratory exchange ratio, total locomotor activity and food intake measurements 
were made with a 16-cage Columbus Instruments Oxymax Comprehensive Lab 
Animal Monitoring System at ambient room temperature (21-23 °C). Whole- 
body composition was assessed with an EchoMRI 3-in-1 on conscious mice both 
before and after calorimetry. Because body weight and body composition were 
unchanged between WT and Cdk5-KO mice, data were analysed by analysis of 
variance (ANOVA). 

Mass spectrometry. Enrichment using ActivX ATP probes (Thermo) combined 
with phosphopeptide enrichment were used to profile kinases**. In brief, tissue 
extracts were incubated in the presence of non-hydrolysable ATP analogues coupled 
to a desthiobiotinylated tag. These small-molecule probes are designed to covalently 
attach to ATPases, including protein kinases. Peptides were then labelled with 
TMT isobaric tags and the resulting mass spectra were analysed quantitatively. 
The method was recently adapted to include an additional phosphopeptide enrich- 
ment step, thereby improving the identification and quantification of protein 
kinase activities”. The methods are similar to those described in ref. 37. Mouse 
tissue homogenate from three WT and three Cdk5-KO mice was subjected to gel 
filtration in spin columns (Zeba; Pierce) in accordance with the manufacturer’s 
instructions to remove endogenous ATP, ADP and small molecules and then diluted 
with reaction buffer (25mM Tris-HCl pH 7.4, 150mM NaCl, 1mM EDTA, 1% 
Nonidet P40, 5% glycerol) to a final protein concentration of 2 mg ml’. Protease 
inhibitors (1 X ‘complete’; Roche 04693132001) and phosphatase inhibitors (final 


concentration 2 mM imidazole, 1 mM sodium fluoride, 1.15 mM sodium molyb- 
date, 4mM sodium tartrate dehydrate, 1 mM f-glycerophosphate, 50 uM pheny- 
larsine) were added along with MnCl, to a final concentration of 10 1M. Lysates 
were incubated with ActivX ATP probes for 10min at room temperature at a 
concentration of 20 11M. The reaction was quenched with 8 M urea, reduced with 
dithiothreitol (5 mM final concentration) and then alkylated with iodoacetamide 
(15 mM final concentration). The solution was then subjected again to gel filtration 
(Zeba; Pierce). Streptavidin was then added to the lysate to capture the undigested, 
desthiobiotinylated proteins and kinases. After extensive washing (lysis buffer 
containing 6 M urea (five times with 100 il), then 50 mM HEPES (five times with 
100 pl)), the captured proteins were subjected to on-bead digestion with trypsin 
(5 ng pl” ') for 4h at 37°C ina tandem-mass-tag-compatible buffer (50 mM 
HEPES pH 7.4, 0.5 M guanidinium chloride). After digestion, the resulting pep- 
tides were extracted and the beads were washed with 50 mM HEPES (twice with 
50 ll) and these washes were added to the peptide mixture. Acetonitrile was added 
to the peptide mixture to a final concentration of 30%, and the peptides were 
subjected to Tandem mass tags (TMT; Thermo Scientific) labelling. For labelling, 
0.8 mg of each TMT reagent (126, 127, 128, 129, 130 and 131) was resuspended in 
40 pl of anhydrous acetonitrile. Peptides were resuspended in 17.5 ul of 50 mM 
HEPES pH 8.5 and 5 ul of acetonitrile, to which 2.5 jl TMT reagent was added. The 
TMT labelling reaction was performed at room temperature for 1 h, and individual 
labelling reactions were quenched by the addition of 3 pl of 5% hydroxylamine. 
The six samples were then combined and desalted with StageTips”*. 

For phosphopeptide enrichment, peptides were resuspended in 100 pl of bind- 
ing buffer (2 M lactic acid, 50% acetonitrile). Phosphopeptides were enriched with 
TiO, as described’. TiO, resin (600 1g; GL Sciences) was prepared by washing 
twice with 200 pl of binding buffer and was then added to the peptides in binding 
buffer and incubated for 1h at room temperature. After incubation, beads were 
recovered by centrifugation (4 min at 200g) and washed with binding buffer (five 
times with 200 jl). Bound phosphopeptides were then eluted with 50 mM K,HPO, 
pH 10 (three times with 20 pl) and further purified with StageTip. The purified 
phosphopeptides were resuspended in 8 1l of 5% formic acid, and 4 il was injected 
and analysed by LC-MS3. 

In the Cdk5 experiment, tissue homogenate was pretreated with Cdk5/p35 
(Millipore) for 10 min with the indicated amounts before being processed with 
the above protocol. ActivX ATP probes, high-capacity binding streptavidin and 
TMT were obtained from Thermo Scientific; modified trypsin was obtained from 
Promega; and TiO, beads were obtained from GL Sciences. SepPak Cg solid-phase 
extraction cartridges were purchased from Waters Corporation. 

Liquid chromatography (LC) and mass spectrometry (MS) analysis. LC-MS/ 
MS analysis was performed on an LTQ Orbitrap Velos or an LTQ Orbitrap Elite 
mass spectrometer (Thermo-Fisher Scientific) linked to an Accela 600 quaternary 
LC pump (Thermo) and a Famos autosampler (LC Packings). Flow rates of 300 nl 
min over the column were achieved by using a flow-split method. A hand-pulled 
fused silica microcapillary column (125 um X 18 cm) was used for peptide sepa- 
ration. The column was first packed with about 0.5 cm of Magic C4 resin (particle 
size 5 [um, pore size 100 A; Michrom Bioresources) and then with 18 cm of Maccel 
Cig AQ resin (particle size 3 lum, pore size 200 A; Nest Group). The total LC-MS 
run length for each sample was 180 min and consisted of a 150-min gradient from 
3% to 33% acetonitrile in 0.125% formic acid. A recently developed MS3 method 
was used to overcome the interference problem in the acquisition of TMT data*®. 
In brief, a high-resolution MS1 scan in the Orbitrap (300-1500m/z, 60k resolution; 
automatic gain control (AGC) 10°) was collected from which the top ten precursors 
were selected for MS2 analysis followed by MS3 analysis. The MS2 scan was per- 
formed in the quadrupole ion trap (collision-induced dissociation, AGC 2 X 10°, 
normalized collision energy 35, maximum injection time 100 ms) and the MS3 scan 
was analysed in the Orbitrap (HCD, 30k resolution, maximum AGC 1.5 X 10°, 
maximum injection time 250 ms, normalized collision energy 50). Multiple frag- 
ment ions from each MS2 spectrum were selected for MS3 analysis using isolation 
waveforms with multiple frequency notches". 

Data analysis. Statistical data analysis was performed with GraphPad Prism and 
Microsoft Excel, including Daniel’s XL Toolbox Add-In. Unless otherwise specified, 
analyses were by one-tailed Student’s t-test. Sequence alignment was performed 
with a modified Clustal W algorithm using Vector NTI AlignX. Mass spectrometry 
data were processed using an in-house software pipeline’. Raw files were con- 
verted to mzXML files and searched using the Sequest algorithm’ against a com- 
posite database containing sequences from the mouse uniprot database in forward 
and reverse orientations as well as the sequences of common contaminating proteins 
(for example trypsin). Database searching matched MS/MS spectra with fully tryptic 
peptides from this composite database with a 20 p.p.m. precursor ion and a product 
ion tolerance of 1 Da. Carbamidomethylation of cysteine residues (+57.02146 Da) 
and TMT tags on peptide amino termini and lysines (+229.162932 Da) were set as 
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static modifications. Variable modifications of oxidation of methionine residues 
(+15.99492 Da) and phosphorylation (+79.966330 Da) on serine, threonine and 
tyrosine residues were used. The data were filtered to a false discovery rate of less 
than 1% based on the target-decoy database approach at both the peptide and 
protein levels“. Linear discriminant analysis was performed to generate a classifier 
to distinguish between correct and incorrect MS2 spectra assignments based on the 
following parameters: XCorr, ACn, peptide ion mass accuracy, charge state and 
peptide length, as described”. Peptides were then assembled into proteins that 
were scored probabilistically and further filtered to a protein-level false discovery 
rate of about 1%. Peptide quantification using TMT reporter ion intensity was 
performed using in-house software, as described’. In brief, a 0.06m/z window 
around the theoretical m/z value of each reporter ion was scanned for ions, and the 
intensity of the signal nearest to the theoretical m/z value was recorded. The 
intensities of the reporter ions were adjusted to account for isotopic impurities 
in each TMT variant (as provided by the manufacturer). For comparisons, the 
whole data sets were filtered to a 1% false discovery rate, and proteins and phos- 
phosites were then quantified by summing reporter ion counts for all the peptide- 
spectral matches. Filtering was performed to remove poor-quality MS3 spectra in 
amanner similar to that described previously*’. Protein quantification values were 
exported for further analysis in Excel or Matlab. Hierarchical clustering was 
performed using Matlab. 

In vitro, MS-based kinase activity assay. PPARy (1 jig; Active Motif) was incu- 
bated with 50 ng of recombinant kinases MEK2, ERK2 (SignalChem) and Cdk5/ 
p35 (Millipore) and kinase reaction buffer containing 25 mM Tris-HCl pH7.5, 
5mM ATP, 7.5mM MgCh, 0.2 mM EGTA, 7.5 mM f-glycerophosphate, 0.1 mM 
Na3VO, and 0.1 mM dithiothreitol in a final reaction volume of 50 ul. All proteins 
corresponded to the human sequences. After incubation for 45 min at room tem- 
perature, the reaction mixture was subjected to both LysC and trypsin digestion 
(individual reactions). After purification, the samples were then analysed by LC- 
MS/MS in a similar manner to that described above but without the additional 
MS3 dimension. 
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Extended Data Figure 1 | Metabolic profiling of adipose-specific Cdk5-KO 
mice ona standard chow diet. a-e, Fasting plasma levels of glucose (a), insulin 
(b), total triacylglycerols (c), free fatty acids (FFA) (d) and total cholesterol 
(e) (n = 16 (control) and 17 (KO)). f, g, Body weights (f) and intraperitoneal 
glucose tolerance test (g). Mice were 12 weeks of age (n = 14 (control) and 
11 (KO)). No significant differences were observed. Error bars indicate s.e.m. 
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Extended Data Figure 2 | Energy homeostasis of adipose-specific Cdk5-KO — ambulatory locomotor activity (d), cumulative food intake (e) and body 
mice maintained on a high-fat diet. a—f, After a 48-h acclimatization period, _ weights (f) (n = 8 per group). Shaded areas signify the dark phase of the light 
singly housed mice were monitored for oxygen consumption (Voz) (a), cycle. No significant differences were observed. Error bars indicate s.e.m. 
carbon dioxide production (Vcoz) (b), respiratory exchange ratio (RER) (c), 
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Extended Data Figure 3 | Activity of alternative kinases in adipose tissue 

from Cdk5-KO mice. Brown adipose tissue protein lysates from mice 

maintained on a high-fat diet for 12 weeks. Blotting for phospho-p38, 


phospho-JNK and phospho-S473 and pT308 AKT was performed before 
loading for total protein amounts. 
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Extended Data Figure 4 | Conservation of the sites on MEK2 
phosphorylated by Cdk5. Mouse MEK2 T395/T397 corresponds to human 


MEK2 T394/T396. These sites share identity with MEK1 T386/T388 in both 
humans and mouse. Cdk5 has previously been shown to phosphorylate MEK1 


298 = MEK2 


at T286, a site not shared with MEK2. ERK has been shown to phosphorylate 
MEK1 1386 and contribute to regulation of kinase activity’. Homo, Homo 
sapiens; trog, Pan troglodytes; mus, Mus musculus; rat, Rattus norvegicus; 

bos, Bos taurus; canis, Canis lupus familiaris. 
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Extended Data Figure 5 | Body weight of control and of adipose-specific 
Cdk5-KO mice maintained on a high-fat diet after treatment with 
PD0325901. Treatment similar to that in Fig. 4a—c. The body weights are not 
significantly different by ANOVA. Error bars indicate s.e.m. 
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Extended Data Figure 6 | Effects of PD0325901 treatment on ob/ob mice. a-c, Glucose tolerance test (a), adiponectin levels (b) and body weights (c) of ob/ob 


mice treated with PD0325901 (n = 7 (vehicle) and 8 (PD)). *P = 0.05 by Student's t-test. Error bars indicate s.e.m. 
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Extended Data Figure 7 | Inflammatory markers in epididymal white 
adipose tissue from ob/ob mice treated with MEK inhibitors. Gene 
expression analysis was performed on M1 macrophage markers Nos2 and 
tumour necrosis factor-a (TNF-a); M2 macrophage markers Arg], Chil3, 1110, 


Itgax, Clecl0a/Mgl1 and Mgl2; chemotactic ligand Ccl2 and receptor Ccr2; and 
macrophage surface markers Emrl, Cd68 and Csflr (n = 7 or 8 mice per 
group as in Fig. 5f,h). Gene expression was analysed by ANOVA. Error bars 
indicate s.e.m. *P < 0.05; **P < 0.01; ***P < 0.001. 
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Extended Data Figure 8 | Schematic model of PPARy regulation at $273. and increased phosphorylation of S273 PPARy. d, MEK inhibition markedly 
a, In the lean state, PPARy is not phosphorylated. b, In the obese state,S273 decreases S273 PPARy phosphorylation. e, PPARy ligands, including the 
phosphorylation is driven by both Cdk5 and ERK with CDK5 repressing MEK __ thiazolidinediones, block the accessibility of $273 PPARy by either ERK or 
and ERK activity. c, Cdk5-KO results in derepression of MEK and ERK kinases CDKS5 kinases. 
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Subnanometre-resolution electron cryomicroscopy 
structure of a heterodimeric ABC exporter 


JungMin Kim", Shenping Wu2*, Thomas M. Tomasiak**, Claudia Mergel’, Michael B. Winter!, Sebastian B. Stiller?, 
Yaneth Robles-Colmanares”, Robert M. Stroud'?, Robert Tampé?4, Charles S. Craik! & Yifan Cheng? 


ATP-binding cassette (ABC) transporters translocate substrates across 
cell membranes, using energy harnessed from ATP binding and hydro- 
lysis at their nucleotide-binding domains”. ABC exporters are pres- 
ent both in prokaryotes and eukaryotes, with examples implicated in 
multidrug resistance of pathogens and cancer cells, as well as in many 
human diseases**. TmrAB is a heterodimeric ABC exporter from the 
thermophilic Gram-negative eubacterium Thermus thermophilus; 
it is homologous to various multidrug transporters and contains 
one degenerate site with a non-catalytic residue next to the Walker 
B motif’. Here we report a subnanometre-resolution structure of 
detergent-solubilized TmrAB in a nucleotide-free, inward-facing con- 
formation by single-particle electron cryomicroscopy. The recon- 
structions clearly resolve characteristic features of ABC transporters, 
including helices in the transmembrane domain and nucleotide- 
binding domains. A cavity in the transmembrane domain is access- 
ible laterally from the cytoplasmic side of the membrane as well as 
from the cytoplasm, indicating that the transporter lies in an inward- 
facing open conformation. The two nucleotide-binding domains 
remain in contact via their carboxy-terminal helices. Furthermore, 
comparison between our structure and the crystal structures of other 
ABC transporters suggests a possible trajectory of conformational 
changes that involves a sliding and rotating motion between the two 
nucleotide-binding domains during the transition from the inward- 
facing to outward-facing conformations. 

ABC transporters use ATP binding and hydrolysis to drive substrate 
translocation across a membrane. Many members of the ABC exporter 
family have varying selectivity and transport substrates from inside to 
outside the cell, a property thought to allow them to facilitate export of 
xenobiotics such as drugs and toxins®’. TmrAB has similar features to 
multidrug transporters, including transport of Hoechst 33342 dye and 
competitive inhibition by verapamil, which suggests a common mech- 
anism for transport’. Itis composed of two homologous subunits, TmrA 
and TmrB, arranged with pseudo-two-fold symmetry with a combined 
molecular mass of ~135 kilodaltons (kDa). Each subunit has a six-helix 
transmembrane domain (TMD) anda cytoplasmic nucleotide-binding 
domain (NBD). TmrAB has two ATP binding sites, each formed between 
both NBDs. However, only one site is an active ATPase consensus site capa- 
ble of ATP hydrolysis. The degenerate (‘inactive’) site has a non-canonical 
aspartate residue next to the Walker B motif contributed by one NBD and 
non-canonical residues from the ABC signature motif of the other NBD*. 

Despite several crystal structures of ABC exporters representing var- 
ious states along the transport cycle, there are competing models ofhow 
these states are functionally connected in a physiological setting*”””. 
Single-particle electron cryomicroscopy (cryo-EM) holds the promise 
of offering structural information complementary to X-ray crystallog- 
raphy, especially for conformational states that may be difficult to access 
within the confines ofa crystal lattice. However, structure determination 
by single-particle cryo-EM is favoured by a relatively large molecular mass 


and higher symmetry. Although recent technological breakthroughs have 
enabled determination of the first atomic structures of homo-tetrameric 
ion channels of ~300 kDa (ref. 11), determining a high-resolution struc- 
ture of TmrAB still represents a major challenge, owing to its smaller size 
and pseudo-symmetric organization”. 

Here, we used fragment antigen-binding (Fab) domains to overcome 
these challenges. A Fab that forms a stable complex with TmrAB has 
a number of advantages for structure determination by single-particle 
cryo-EM”’. Furthermore, conformational-specific synthetic Fabs sta- 
bilize particles in a specific functional state’*. Following our established 
procedure for Fab selection", five Fabs were identified from a human naive 
B-cell Fab phage-displayed library using n-dodecyl-B-D-maltopyranoside 
(B-DDM)-solubilized TmrAB as the antigen. Fab binding was validated 
using a qualitative enzyme-linked immunosorbent assay (ELISA) screen’” 
(Fig. laand Extended Data Fig. 1a). The Fabs were further characterized 
by competitive ELISA analysis to establish whether they have overlap- 
ping or independent epitopes (see Methods and Fig. 1b). AD12, BA6, 
and AH11 (‘class A’) were found to have overlapping epitopes, inhibit- 
ing the binding of one another, whereas AH5 (‘class B’) and DHS (‘class 
C’) were found to have unique epitopes. Rigidity of the complexes was 
assessed by negative-stain electron microscopy two-dimensional class 
averages. All TmrAB-Fab complexes yielded two-dimensional class aver- 
ages that show characteristic features of Fabs (Fig. 1c and Extended Data 
Fig. 1d), suggesting that these Fabs form sufficiently rigid complexes 
with TmrAB. TmrAB complexes that clearly show two Fabs (Fig. 1c) 
confirm that those Fabs bind to distinct sites. Furthermore, ELISA and 
negative-stain electron microscopy demonstrated that AH5 and BA6 
display the highest relative affinities (Extended Data Fig. 1b-d). Thus, 
these Fabs are preferred candidates for structure determination of TmrAB 
by cryo-EM. To gain more insight into the Fab-binding properties, AH5 
and class A Fabs were used in an immunoblotting assay with denatured 
TmrAB. AH5, despite showing the highest affinity (Extended Data 
Fig. 1d), did not bind the denatured transporter, whereas all three class 
A Fabs, including BA6, bound to denatured TmrB (Fig. 1d). These data 
suggest that AH5 recognizes a three-dimensional epitope and that BA6 
recognizes a linear epitope(s) within the TmrB sequence. 

Detergent-solubilized TmrAB-AH5 complex was purified (Extended 
Data Fig. 2a). Frozen-hydrated TmrAB-AH5 particles were imaged using 
a direct electron-detection camera, K2 Summit, following newly imple- 
mented procedures'®. Two-dimensional class averages of the TmrAB- 
AH5 complex show landmark features of both TmrAB and Fab (Extended 
Data Fig. 2b-e). We determined a three-dimensional reconstruction of 
TmrAB-AHS toa resolution of 8.2 A using gold-standard Fourier shell 
correlation = 0.143 criterion (Extended Data Fig. 2f, g)’’. Local resolu- 
tion estimation'® suggests most regions of the density map have a ~6 A 
resolution (Extended Data Fig. 2h, i). The density map is of sufficient 
quality to resolve the secondary structure features clearly, including all 
helices in the TMDs and NBDs (Fig. 2 and Extended Data Fig. 3). Such 
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Figure 1 | TmrAB Fab characterization. a, Qualitative ELISA to assess 

Fab binding to TmrAB. All Fabs showed binding to TmrAB except BG12 in 
both experiments A and B, where independently prepared Fab samples were 
used. Expression levels were assessed by immunoblotting and normalized 
(Extended Data Fig. 1a). b, Representative competitive ELISA between 
AD12-myc and Fab-Flag against TmrAB. AD12-myc binding was not affected 
by the presence of AH5-Flag or DH5-Flag (top left). AH5-Flag and DH5-Flag 
maintained near-maximum binding (*) at all AD12—myc concentrations 
(top right). AD12-myc binding was almost abolished in the presence of 
AD12-Flag or BA6-Flag (middle left). AD12-Flag and BA6-Flag maintained 
near-maximum binding (*) at all AD12-myc concentrations (middle right). 


well-defined structural features suggest that TmrAB as visualized here 
adapts a defined conformation of an ABC exporter with well-ordered 
domain architecture. Furthermore, density corresponding to the detergent 
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Figure 2 | Three-dimensional reconstruction of TmrAB-AH5 at 
subnanometre resolution. a, Cryo-EM density map of the TmrAB-AH5 
complex filtered to a resolution of 8.2 A. The atomic model of TmrAB and an 
atomic structure of a Fab (Protein Data Bank accession number 1M71) are 
docked into the density map. The map shows two NBDs, a bi-lobed DDM 
micelle, which is separated by TM4 (marked by two dotted lines in the top 
view), and well-defined AH5 density. In the bottom view, two parallel solid 
lines and dotted lines indicate orientations of the front and back halves of the 
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AD12-myc binding was not significantly affected in the presence of 
AH11-Flag (bottom left). AH11—Flag binding decreased as AD12—myc 
concentrations increased shown (bottom right). Uninhibited binding of 
AD12-myc and Fab-Flag suggests independent binding between AD12 and 
AHS or DHS. Inhibited binding of either AD12—myc or Fab-Flag suggests 
overlapping epitopes between AD12 and BA6 or AH11. c, Representative 
negative-stain two-dimensional class averages of TmrAB-Fab complexes. 

d, Immunoblotting of TmrAB, using Fab-Flag. Class A Fabs recognized the 
denatured form of TmrB, and AH5 did not recognize the denatured form of 
either strand. BG12 was used as a non-binder control, which did not detect 
either strand significantly. 


micelle defines a distorted ellipsoidal structure around the TMDs. The 
characteristic dumb-bell-shaped Fab density is also clearly defined, vali- 
dating the correctness of the three-dimensional reconstruction. 


micelle respectively. The two halves are tilted by ~30° with respect to each 
other. b, The density map at a higher contour level shows clearly resolved 
transmembrane helices. c, A cross-section view through the TMDs shows 
well-resolved transmembrane helices labelled red and blue for TmrA and TmrB 
respectively. d, NBDs of TmrAB in two different views. C-terminal helices 
(*) of the two NBDs are in close proximity depicted both in the density map and 
the docked atomic model. 
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To confirm that the conformation of TmrAB is not influenced by 
AHS binding, we determined two other three-dimensional reconstruc- 
tions: TmrAB alone and in complex with BA6. The three-dimensional 
reconstruction of the TmrAB-BA6 complex was determined to a reso- 
lution of 9.4 A (Extended Data Fig. 4) using a phosphor scintillator- 
based complementary metal-oxide-semiconductor (CMOS) camera. At 
this resolution, the majority of the TmrAB helices are resolved, and the 
BA6 Fab density has the expected shape. The three-dimensional recon- 
struction of TmrAB alone was determined using the same CMOS camera 
to 10 A resolution (Extended Data Fig. 5). The densities of TmrAB in 
all reconstructions overlap with each other in a unique orientation as 
measured by local cross correlation (Extended Data Fig. 6), confirm- 
ing that TmrAB is in a native conformation that is not induced by Fab 
binding. These experiments also demonstrated that both Fab incorp- 
oration and utilization of the direct electron-detection camera improved 
accuracies of image alignment and subsequent resolution of the final 
reconstruction (Extended Data Table 1). 

A model of TmrAB was constructed based on homology modell- 
ing and molecular dynamics flexible fitting. The crystal structure of a 
heterodimeric ABC exporter from a thermophilic bacterium, Thermotoga 
maritima, TM287/288 (ref. 8), was used as the starting template. TM287 
shares 31.3% sequence identity with TmrB, whereas TM288 shares 37.8% 
sequence identity with the TmrA. After consideration of two-dimensional 
class averages (Fig. 1c), immunoblotting data (Fig. 1d) and three- 
dimensional reconstruction of TmrAB-BA6 (Extended Data Fig. 4), 
we concluded that BA6 binds to the tip of the NBD of TmrB, opposite 
the AH5 binding site. This allowed us to position unambiguously the 
homology model into the density map as well as to determine the hand- 
edness of the three-dimensional reconstruction. The model was fitted into 
the TmrAB-AH5 density map by molecular dynamics flexible fitting”, 
yielding the final model of TmrAB (Extended Data Fig. 7). In both the 
three-dimensional density map and the fitted model, a cavity is evident 
that is accessible to the cytoplasm, suggesting that TmrAB is in an inward- 
facing conformation (Fig. 3a). The model also confirms that AH5 and 
BA6 interact with the NBD of TmrA and TmrB respectively. Such Fab 
binding inhibits ATPase activity of TmrAB (Extended Data Fig. 8), thus 
locking TmrAB in the current conformation. 

In the three-dimensional reconstructions, the detergent micelle forms 
a torus in which two helices lie in the surface, resulting in an unusual 
bi-lobed micelle with two halves tilted from each other by about 30° 
(Fig. 2a). This separation is mediated in both monomers by TM4, which 
curves outwards with TM5 and protrudes closer to the head groups of 
a-DDM (Fig. 2c). The TMDs contain two subdivisions, each of which 
is composed of TM1, TM2, TM3 and TM6 of one monomer and TM4 
and TM5 of the other. The splitting of the micelle matches the subdi- 
visions of the TMDs in this conformation. 

A large cavity is located on one side of the TMDs, surrounded by 
TM1-3 and TM6 of TmrA and TM4-6 of TmrB (Fig. 3a). Significant 
density (visualized at 5a) was observed in the cavity bound to TM5 of 
TmrB (Fig. 3b and Extended Fig. 3a), analogous to the substrate binding 
sites identified for glutathione ATM1-type transporters”””’. This den- 
sity has sufficient size to accommodate molecules as large as a Hoechst 
33342 molecule, a known substrate for TmrAB’. While the identity of 
the molecule that contributed to this density is unknown, we speculate 
that it is attributable to an unknown molecule co-purified with TmrAB, 
molecules of DDM detergent or lipids, suggesting that the cavity con- 
tains the cytoplasmic substrate-binding site. Although the cavity is open 
to the cytosol, it also has a small, lateral V-shaped gap (formed by TM4 
and TM6 of TmrB) that provides an additional entry point from the 
inner leaflet of the surrounding membrane. However, this gap is con- 
siderably smaller than the opening found in a wide-open, V-shaped apo 
conformation, such as in the open apo state of MsbA”. 

The structure of TmrAB reported here is in an inward-facing state 
in which two NBDs are in contact with each other via their carboxy 
(C)-terminal helices. This conformation is similar to the substrate- 
bound nucleotide-free state of ATM1-type exporters” or a eukaryotic 
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Figure 3 | Atomic model of TmrAB showing the laterally open inward- 
facing conformation. a, Representation of internal volume and opening to the 
external surface of the transporter. TmrA is coloured salmon and TmrB is 
coloured cyan. b, Two different views (tilted around the axis perpendicular to 
the membrane plane) of the substrate-binding cavity in the TMDs. Density 
bound to helix H4 (Tyr 187) and H5 (His 246) of TmrB was observed at a 
threshold of 5c. It has the size to accommodate a Hoechst 33342 molecule, 
which is a known cargo molecule of TmrAB, but was not added during the 
protein purification. The cargo-like density is inside the cavity but near the 
inner leaflet of the membrane. This position suggests a possible substrate 
pathway. c, Ribbon diagram of the TmrAB NBDs in the nucleotide-free 

state. Predicted locations of two pairs of cysteine mutations are marked. 

d, Non-reducing SDS-polyacrylamide gel electrophoresis (SDS-PAGE) gel 
demonstrating disulphide cross-linking of both double cysteine mutants in the 
apo state, and showing a clear difference in cross-linking behaviour between 
nucleotide-free and -bound TmrAB. 


P-glycoprotein homologue”. We confirmed that the isolated TmrAB truly 
is nucleotide free (Extended Data Fig. 8). Therefore, in the nucleotide- 
free TmrAB structure, two NBDs remain in contact with each other via 
the C-terminal helices of both NBDs (Fig. 2d), burying a total surface 
area of ~980 A (see Methods) with a minimal distance between the Ca 
positions of the two C-terminal helices of ~5 A. Disulphide bond cross- 
linking of nucleotide-free TmrAB with two independent cysteine muta- 
tion pairs in their C-terminal helices (TmrA-L585C/TmrB-G564C, or 
TmrA-A591C/TmrB-A567C) validated this interaction (Fig. 3c, dand 
Extended Data Fig. 9). Consequentially, the two subdivisions of the TMDs 
are not widely separated. Indeed, no TmrAB particles were observed in 
a wide-open, V-shaped conformation as found in some crystal struc- 
tures of nucleotide-free ABC exporters where the two NBDs are sepa- 
rated by a distance of 20-40 A (refs 9, 22, 24, 25). 

Connecting conformational states of a given ABC transporter to its 
substrate transport cycle requires structures at all distinct functional 
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states, which are currently not available for most transporters. There- 
fore we were limited to comparing the structures from different mem- 
bers of the family. The well-resolved helices and distinct features of our 
reconstruction ensure that the structure of TmrAB reported here is 
of sufficient precision to compare on a secondary structural level with 
structures of other ABC exporters, the AMPPNP-bound intermediate 
inward-facing structure of TM287/288 (ref. 8) and the ADP-bound 
outward-facing structure of homodimeric Sav1866 (ref. 26), determined 
by crystallography. Such comparison suggests a trajectory of conforma- 
tional changes connecting these states (Supplementary Videos 1 and 2). 
In TmrAB, the Walker A motifs in the two NBDs face each other but do 
not face the ABC signature motifs of the opposite NBDs (Fig. 4a top). In 
AMPPNP-TM287/288, the NBDs are in a different dimer contact state 
with the Walker A and ABC signature motifs of the opposite NBDs 
facing each other (Fig. 4a middle). In the outward-facing Sav1866 struc- 
ture, the NBDs are in a further closed dimer state (Fig. 4a bottom) to 
sandwich two nucleotides between the NBDs as if in a hydrolytically 
competent-like state. This suggests that the NBDs may slide and rotate 
against each other as they bind, then sandwich ATP molecules between 
the domains during the transition from the inward- to outward-facing 
conformation in a mechanism similar to that of the ABC importers”’. 

The relative NBD positions are associated with reorganizations of the 
TMDs, especially TM4 and TM6. In TmrAB, TM4 and TM6 of chain B 
show the furthest separation of ~18 A (Fig. 4b top), which corresponds 


I 
Walker A 


Sav1866 


Figure 4 | Comparison of the inward-facing conformation of TmrAB 

with the intermediate inward-facing structure of TM287/288 and the 
outward-facing structure of Sav1866. a, Conformational changes in the 
NBDs viewed from the membrane towards the cytosol. The Walker A motifs 
(blue) do not face the ABC signature motifs (green) in the atomic model of 
TmrAB (top) while they do in the TM287/288 structure (middle) where a single 
AMPPNP molecule (shown as sticks) is bound predominantly in one NBD. 
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to the lateral cavity opening shown in the open volume calculation of 
Fig. 3 as well as the substrate position. The lateral cavity opening is pres- 
ent in the intermediate state structure of TM287/288 but has narrowed 
toa maximal separation of 10 A (Fig. 4b middle) between the hinges of 
TM4 and TM6. In the outward-open conformation of Sav1866, the 
cavity is completely occupied by TM4 and TM6, leaving an opening 
exposed to the extracellular surface (Fig. 4b bottom). The rearrange- 
ments in the transmembrane domain, specifically TM4 and TM6, work 
in concert to close the lateral membrane gate and open a perpendicular 
extracellular gate. These movements are coupled to sliding and rota- 
tion of the NBDs from an orientation with Walker A motifs in contact 
with each other to an orientation wherein the Walker A motifs contact 
the opposing ABC motif (Supplementary Videos 3 and 4). 

Here, we have used single-particle cryo-EM to determine a subna- 
nometre-resolution structure of detergent-solubilized TmrAB in the 
nucleotide-free state without influence from crystal packing. The result- 
ing structure revealed an asymmetric inward-facing state with NBDs in 
contact but not ina position for ATP hydrolysis. Compared with another 
membrane protein recently determined to 3.4 A resolution (TRPV1, 
~300 kDa and C4 symmetry)"’, TmrAB is a more challenging target 
because of its smaller size (~135 kDa) and its pseudo-symmetry. The 
structure reported here greatly extends the capability of single-particle 
cryo-EM. At this resolution all helices in TMDs and NBDs are unam- 
biguously resolved, affording the determination ofa reliable model based 


In the Sav1866 structure (bottom), the two motifs from the opposite domains 
come close to sandwich two ADP molecules (shown as sticks). b, Changes in 
lateral opening to the cavity. The lateral opening between TM4 and TM6 
shown by dotted lines (left) narrows down from the TmrAB model (top) to the 
TM287/288 structure (middle), and completely closes in the Sav1866 structure 
(bottom), resulting in an opening to the extracellular space. 
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on the well-established workflow of homology modelling and flexible 
docking’’. It was made possible by taking advantage of three novel tech- 
nologies: (1) Fab-assisted single-particle cryo-EM”’, (2) newly developed 
cryo-EM methodologies including a direct electron-detection camera 
and (3) maximum likelihood based classification and refinement”*. Each 
of these technological advancements improves accuracy of image align- 
ments and the resolutions of the final reconstructions (Extended Data 
Table 1). In theory, it is possible to achieve ~3 A resolution structure of 
proteins as small as ~ 100 kDa (ref. 12). In practice, there are still major 
technological limitations that prevent achieving this goal, even with recent 
advances. Nevertheless, we expect that the combined approaches used 
in this study will be a powerful strategy for capturing the molecular fea- 
tures of membrane proteins in discrete functional states that are diffi- 
cult to capture by crystallization. These structural snapshots will aid in 
the elucidation of membrane protein mechanisms while advancing the 
limits in structural determination by single-particle cryo-EM. 

While this paper was under review, a nucleotide-free structure of 
TM287/288 was reported”, which is almost identical to the singly 
AMPPNP-bound structure (r.m.s.d. of 0.636 A). It supports our conclu- 
sion that the NBDs remain in contact in the nucleotide-free state, but 
cross-linking behaviour (Fig. 3c) suggests a noticeable conformational 
change in TmrAB upon AMPPNP binding. Fully addressing such dif- 
ferences requires structures of the same protein at all functional states, 
which single-particle cryo-EM holds the promise to deliver. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Sample preparation. TmrAB was expressed and purified similarly as described 
previously’. Protein was extracted by using 1% B-DDM in a buffer containing 50 mM 
Tris pH 8.0, 300mM NaCl, and 1 mg ml’ iodoacetimide was added followed by 
purification on Ni?* resin in 50 mM Tris pH 8.0, 300 mM NaCl, 0.05% «-DDM and 
finished with size-exclusion chromatography on a Superdex S200 (GE Healthcare) 
with 20 mM HEPES pH 7.0, 150 mM NaCl 0.05% o-DDM. For ELISA, purified 
TmrAB was biotinylated, using EZ-Link NHS-Chromogenic-Biotin (Pierce) as 
described previously’. 

Phagemids were used for myc-tagged Fab (Fab-myc) expression without further 
modification’’. For constructs to express Flag-tagged Fab (Fab-Flag), the myc tag 
in the phagemid was replaced with the Flag tag by PCR site-directed mutagenesis 
with a primer encoding DYKDDDDK. The amplified PCR products were cloned 
in at BstEII and Notl sites in the phagemids. 

Fabs were expressed in Escherichia coli with IPTG induction. Phagemids encod- 
ing each Fab gene were transformed into BL21 (DE3) Gold (Stratagene). Transformed 
cells were grown in 2YT containing 100 pg ml‘ ampicillin and 2% glucose (2YT 
Amp Glu) at 250 r.p.m. and 30 °C overnight. The overnight culture was aliquoted 
into fresh 2YT Amp 0.1% Glu such that attenuance (D¢o0 nm) was around 0.05. The 
cultures were grown at 250 r.p.m. and 37 °C until the log phase where Dgo0 nm Was 
between 0.5 and 0.8 and induced with 1 mM IPTG. The induced cultures were grown 
at 200 r.p.m. and 20 °C overnight. 

E. coli-expressed Fabs were subject to periplasmic protein fractionation by osmotic 
shock. Cell pellets of the induced cultures were re-suspended in ice-cold TES buffer 
(0.2 M Tris pH8, 0.5 mM EDTA, 0.5 M sucrose), followed by an equal volume of ice- 
cold double-distilled water containing protease inhibitor cocktail (complete EDTA- 
free, Roche). The cell re-suspension was incubated on ice for 30 min with gentle 
swirling every 10 min and spun down at 13,000g for 15 min. The supernatant was 
collected for further purification. 

The periplasmic fraction was subject to affinity chromatography by a batch pro- 
cedure using Ni’*-NTA agarose resin (Qiagen) and a standard protocol recom- 
mended by the manufacturer. Purified Fabs were dialysed against PBS buffer. For 
complex formation, the Fab samples were further purified by size-exclusion chro- 
matography using Superdex 75 (Pharmacia Biotech). For cryo-EM, TmrAB and 
Fab were mixed ina 1:2 molar ratio respectively, incubated for 30 min at room tem- 
perature (~22 °C) and subject to size-exclusion chromatography, using Superdex 
200 (Pharmacia Biotech). All samples for cryo-EM were prepared and stored in 
20mM HEPES buffer, pH 7 containing 150 mM NaCl and 0.05% a-DDM. 

Fab selection. Our Fab selection procedure has been described previously’’. For a 
brief summary, the procedure included indirect immobilization of biotinylated 
TmrAB to streptavidin-coated magnetic beads, three rounds of selection of phage 
displayed Fabs that bind to TmrAB, and identification of binders by an ELISA screen, 
which used Fabs with unknown concentrations. In the Fab nomenclature, the first 
letters designated plates, and the second letters and the numbers designated well 
positions. 

Qualitative and competitive ELISA. Qualitative ELISA was done as described 
previously'’. TmrAB was coated in the wells of Maxisorp plates (Nunc) and Fab 
solutions at unknown concentrations were added to the wells for binding. Fab solu- 
tions were prepared by serial dilutions of unpurified Fabs to yield series of relative 
concentrations. 

Competitive ELISA was done in the same way as qualitative ELISA analysis using 
relative Fab concentrations’, with the following modifications. (1) Equal volumes 
of Fab-myc and Fab-Flag solutions were added for each binding reaction. Unpu- 
rified Fab-myc at varying concentrations was prepared by serial dilution. Unpurified 
undiluted Fab-Flag was kept at a constant concentration. (2) Duplicates for each 
binding reaction were set up. (3) Anti-c-myc-peroxidase (Roche) was added in one 
reaction, and anti-Flag M2-peroxidase (Sigma) was added in the other. In Fig. 1, 
binding reactions for each graph contain AD12-myc at varying concentrations and 
Fab-Flag at constant concentrations, and TmrAB immobilized on wells. AD12- 
myc concentration in each binding reaction was relative to undiluted supernatant 
of AD12-myc culture that was considered 1. Normalized ELISA signals = (ELISA 
signal of Fab-Flag in each binding reaction) /(ELISA signal of Fab-Flag when [AD12- 
myc] = 0); *Normalized maximum binding of Fab-Flag = 1. Bound Fabs were 
detected by anti-Flag M2-peroxidase. 

Analytical HPLC measurements. TmrAB ATPase activity and nucleotide stoichi- 
ometry measurements were performed with reversed-phase HPLC using an Agilent 
1100 system. For all measurements, TmrAB and nucleotide (ATP/ADP) standards 
underwent organic extraction in the same manner with 25:24:1 (v/v) phenol:chlor- 
oform:isoamyl alcohol. The aqueous layer was resolved using a Cg column (Vydac, 
218TP104) and gradient elution (solvent A: 100 mM KH>PO,, pH 6.0; solvent B: 
90% methanol) with absorbance measurements recorded at 254 nm. 

Cysteine cross-linking. TmrAB double cysteine mutants were created to test the 
proximity of the NBDs in the apo state. Two double-cysteine mutant constructs were 
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generated, TmrA_A591C/TmrB_A567C and TmrA_L595A/TmrB_G564C. A591C/ 
A567C was designed using the primers 5'-GGCCAAGGGCGGCTACTACGCC 
TGCTTGTACCGGCTCCAGTTCCAG-3’ and 5'-CAGGCCGGAGGCCTCTA 
CTGCGAGATGGACCGCCTGCAG-3’, and L565C/G564C using 5’-CCGCCT 
TGTACCGGTGCCAGTTCCAGGAGGC-3’ and 5'’-GAGAGCCTCCTTCAGG 
CCGGATGCCTCTACGCGGAGATGGACCGCCTG-3’. Note that TmrAB con- 
tains an exposed native cysteine residue (TmrA-416C) that could not be removed. 

The double cysteine mutants were purified without nucleotides following the 
similar procedure used for the wild-type TmrAB, with B-DDM (~1-1.05%) used 
for protein extraction and no iodoacetimide added. Cross-linking of TmrAB with 
double cysteine mutations was assessed in the presence or absence of 1 mM Cu”* 
phenanthroline, 0.7 mM AMPPNP or ATP/Na’ orthovanadate (5 mM) with or with- 
out 5 mM MgCl, as indicated in the label of Extended Data Fig. 9b. All nucleotides 
were heated to 68 °C, the permissive temperature for maximal ATP hydrolysis, for 
3.5 min followed by immediate cooling on ice then 1 mM copper phenanthroline 
(Sigma) -induced cross-linking for 30 min on ice. Cross-linking was quenched with 
125 mM EDTA ina non-reducing SDS loading buffer and immediately loaded onto 
SDS-PAGE for analysis. 

The expression level of L595C/G564C was much lower than that of A591C/ 
A567C. Nevertheless, L595C/G564C could be purified and was assessed for cross- 
linking with and without the oxidizing reagent Cu’* phenanthroline at a lower 
concentration (567 nM) because of lower yields. Cross-linking in both cases pro- 
ceeded to >99% as judged by SDS-PAGE. 

The A591C/A567C mutant activity was compared with the wild-type protein 
following cross-linking with copper phenanthroline followed by buffer exchange 
on Millipore spin concentrators. Samples were incubated with 73.28 nM TmrAB, 
2.09 mM MgCl and 1 mM copper phenanthroline for 30 min, followed by three 
~100-fold dilutions with the purification buffer (20 mM HEPES, 150mM NaCl, 
0.05% a-DDM). Samples were then incubated with ATP at a final concentration of 
70 nM transporter, 2mM MgCl, and 250 uM ATP (pH 7.0) at 60°C for 1h in 
preparation for analytical HPLC analysis. 

Negative-stain electron microscopy. Each Fab and TmrAB was mixed in a 2:1 
molar ratio in pH 7.4 PBS containing 0.05% B-DDM, incubated at room tempera- 
ture (~22 °C) for 5 min, and negatively stained using uranyl formate. Electron micro- 
scope grids of negatively stained samples were prepared as previously described”. 
Specifically, 2.5 ,tl samples of detergent-solubilized TmrAB alone or in complex 
with Fabs were applied to glow-discharged electron microscope Cu grids covered 
by a thin layer of continuous carbon film and were stained with 0.75% (w/v) uranyl 
formate. Negatively stained electron microscope grids were observed on a Tecnai 
T12 microscope (FEI) operated at 120 kV. Images were recorded at a nominal mag- 
nification of 52,000 using a 4K X 4K CCD (charge-coupled device) camera (UltraScan 
4000, Gatan), corresponding to a pixel size of 2.21 A on the specimen. All images 
were binned to the final pixel size of 4.42 A for further processing. Particles were 
selected manually and were subjected to six cycles of multi-reference alignment 
and K-means classification using SPIDER”’. Approximately 3,000 particles for the 
TmrAB-Fab complex were manually picked from each sample and used for two- 
dimensional analysis. Sixty classes were generated for TmrAB in complex with single 
Fabs except DH5, and all double Fabs. Fifty classes were generated for TmrAB in 
complex with DH5. 

Cryo-EM. Cryo-EM grids were prepared by a standard plunge freezing procedure 
as described". Specifically, Quantifoil grids (Quantifoil Micro Tools) were glow- 
discharged for 10s and a 2 1] sample was loaded onto the grid and plunge-frozen 
in liquid ethane cooled by liquid nitrogen using a Vitrobot Mark III (FEI). CryoEM 
grids of the TmrAB-AH5, -BA6 complexes and TmrAB alone were imaged using 
a Tecnai TF20 electron microscope equipped with a field emission source (FEI) and 
operated at 200 kV. Images were collected at a nominal magnification of 80,000 
using a TemCam-F816 8k X 8k CMOS camera (TVIPS) corresponding to a pixel 
size of 0.935 A on the specimen. All images were binned by a factor of 4 for further 
processing, resulting in a pixel size of 3.74 A. Images were recorded with a defocus 
in the range from — 2.0 to —3.5 jum. Defocus values were determined for each micro- 
graph using CTFFIND3 (ref. 32). 

For the AH5 complex, an additional data set was collected using a Tecnai TF30 
Polara equipped with a field emission source and operated at 300 kV. Images were 
recorded on a Gatan K2 Summit camera operated in super-resolution counting 
mode following the established dose fractionation data acquisition protocol’®. Spe- 
cifically, images from TF30 were recorded at a nominal magnification of x 20,000, 
corresponding to a calibrated super resolution pixel size of 0.98 Aonthe specimen. 
The dose rate on the camera was set to about eight counts (corresponding to ~9.9 
electrons) per physical pixel per second. The total exposure time was 16 s, leading 
to a total accumulated dose of 41 electrons per square angstrém on the specimen. 
Each image was fractionated into 32 subframes, each with an accumulation time of 
0.5 s per frame. All dose-fractionated cryo-EM images were recorded using a semi- 
automated acquisition program, UCSFImage4 (written by X. Li). Images were recorded 
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with a defocus in a range from —2.5 to —4.0 jm. All super-resolution counting 
images were binned by 2 X 2 for further processing, resulting in a pixel size of 1.96 A. 
All 32 motion-corrected subframes were averaged together to a single micrograph 
for subsequent processing. Defocus values were determined for each micrograph 
using CTFFIND3 (ref. 32). For the data set of the AH5 complex collected with TF20 
microscope, the particles were manually picked. For all other data sets, we used a 
semi-automated particle-picking procedure similar to that previously described’. 
For each data set, we manually selected approximately 2,000 particles and generated 
two-dimensional class averages using reference-free classification followed by multi- 
ple rounds of multi-reference alignment and classification. Unique two-dimensional 
class averages were used as templates for automated particle picking. All picked par- 
ticles were subject to visual inspection to remove all possible bad particles. Initial 
three-dimensional models were generated using the common lines method* and 
filtered to 60 A resolution as the initial reference for further refinement. Three- 
dimensional reconstructions were calculated and refined using RELION”*. The 
resolutions of final three-dimensional reconstructions were estimated using gold- 
standard Fourier shell correlation curve = 0.143 criterion’’”. The final map was sharp- 
ened by a resolution-dependent amplitude-scaling factor*? and implemented in 
xmipp”*. Local resolution was estimated using ResMap™. All cryo-EM image pro- 
cess parameters are listed in Extended Data Table 1, which also shows that adding 
Fab and using a direct electron-detection camera improved the alignment accuracy 
and resolution of three-dimensional reconstructions. Local correlations between 
maps were computed using the ‘vop localCorrelation’ function in UCSF Chimera 
(Extended Data Fig. 7). The difference in resolution may contribute to the low cor- 
relation at certain regions. 

Fitting of atomic structures. A homology model of TmrAB was generated using 
SWISS-MODEL Automated Mode” and the atomic structure of heterodimeric ABC 
exporter TM287/TM288 with one AMPPNP bound (Protein Data Bank accession 
number 3QF4) used as the template. The atomic structure of a Fab (Protein Data 
Bank accession number 1M71) was used to fit the Fab density of either AH5 or BAO. 
Atomic models of TmrAB and Fab were docked into the three-dimensional recon- 
structions using the ‘fit-in-map’ function in UCSF Chimera’*. Handedness was 
determined at this stage. The difference of fitting between the default map and its 
mirrored version is significant by visual inspection. The docked models were fur- 
ther refined by molecular dynamics flexible fitting’. The initial homology model 
shows an unstructured region between TM1 and TM2 of TmrA owing to a lack of 
corresponding sequence in the template. We analysed the sequence around the 
region using the Quick two-dimensional function in MPI Bioinformatics Toolkit”, 


and part of it was predicted to form a helix by multiple prediction methods. Therefore, 
we built a short segment of helix for that part using the ‘Build Structure’ function in 
UCSF Chimera. That helical segment fitted well in the density map. All molecular 
graphics images were produced using UCSF Chimera™. 

Structure analysis. Sequence identity between the TmrAB chains and the TM287/ 
288 model were determined with the clustalW webserver. Internal cavities con- 
nected to the external environment were visualized using the 3V algorithm”. This 
procedure uses a 1.5 A sphere to represent water and calculates a completely acces- 
sible volume. The protein surface environment is calculated with a 12 A sphere that 
cannot access inner cavities, and the two volumes are subtracted to provide only 
those buried areas that water can enter from the external environment. In this cal- 
culation, only one large interconnected cavity was identified in the interior of TmrAB, 
connected to the surface via a cleft between TM4 and TM6. Buried surface-area 
calculations were calculated in Pymol. 
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Extended Data Figure 1 | Binding characterization of Fabs against TmrAB. 
a, Expression levels of Fabs used in ELISA in Fig. 1a. Expression levels were 
assessed by immunoblotting with anti-c-myc antibody and normalized against 
the highest expression level such that undiluted AD12 equals a normalized 
relative concentration of 1. Binding was monitored by anti-c-myc antibody. 
b, ELISA with purified Fabs against TmrAB. Purified Flag-tagged Fabs 
(Fab-Flag) were used in binding reactions. Binding was monitored by anti-Flag 
M2-peroxidase. Experiments were repeated twice. c, Relative affinities of the 
Fabs. The ELISA signal data from Extended Data Fig. 1b were fitted to a 
bimolecular binding equation to produce binding curves and apparent 
dissociation constant (K,) values. AH5 showed the highest affinity, followed by 
BA6, AD12 and AH11. DH5 and BG12 did not show significant binding. 
Apparent Kg values do not yield meaningful biophysical properties because the 
concentration of TmrAB participating in the binding reaction is unknown and 


the transporter is not free in solution. However, comparison among the Fabs 
should be sufficient to determine their relative affinities. Unpurified DH5 
showed significant binding (Extended Data Fig. 1a), whereas purified DH5 
did not show significant binding; n.d., not determined. d, Negative-stain 
electron microscopy analysis of Fab +7TmrAB mixes. Representative two- 
dimensional class averages include complex images that show clear Fab 
densities and images that do not. The two typical Fab views, the dumb-bell- 
and doughnut-shaped views, are indicated by yellow and red arrows 
respectively. Fab images indicate that Fabs are rigid and form rigid complexes. 
Percentages of complex particles were assessed by fractions of the numbers 
of images that clearly show Fab densities (# complex particles) to the total 
numbers of images that were included (# total particles) in two-dimensional 
class averages. The percentages correspond to relative affinity ranking 
determined by ELISA (Extended Data Fig. 1c). 
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Extended Data Figure 2 | Purification of a-DDM-solubilized and single- 
particle cryo-EM of TmrAB-AH5 complex. a, Elution profiles of TmrAB 
alone and TmrAB-AH5 from Superdex 200 are shown in blue and red curves 
respectively, showing a clear shift of the elution peak of the TmrAB-AH5 
complex to a higher molecular mass position. The shifted peak corresponding 
to fractions A6-A8 contained TmrAB and AHS, confirmed by SDS-PAGE. 
Fractions B3-B5 correspond to unbound AH5 and the loading material was 
run for comparison. b, Raw micrograph of TmrAB-AH5 (~185 kDa) 
embedded in a thin layer of vitreous ice. c, Fourier power spectrum calculated 
from micrograph shown in a. d, Two-dimensional class averages of the 
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TmrAB-AH5 complex. Fab AHS is clearly visible in many class averages. 

e, Initial three-dimensional reconstruction calculated from two-dimensional 
class averages using the common lines method implemented in SPIDER. 

f, Fourier shell correlation curves of TmrAB-AH5 (red), TmrAB-BA6 (purple) 
and TmrAB alone (blue). g, Euler angle distribution of all particles used in the 
final reconstruction. h, Final three-dimensional reconstruction coloured 

with local resolution. i, Voxel histogram corresponding to local resolution. 
The majority of voxels are at ~6-7 A resolution. Estimation of local 
resolution that is too close to the Nyquist (3.9 A) may not be accurate. 
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Extended Data Figure 3 | Selected slice views of the three-dimensional helices of both TmrA and TmrB are labelled. The arrow points to the extra 
reconstruction of TmrAB-AH5. The views are oriented in parallel with the density in the cavity. b, Two NBDs are in contact with each other. c, The 
membrane plane. The numbers of slices are marked. a, All transmembrane C-terminal helices of TmrA and TmrB are in close contact. 
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Extended Data Figure 4 | Single-particle cryo-EM of TmrAB-BA6 complex. 
a, Raw micrograph of TmrAB-BA6 (~185 kDa) embedded in a thin layer of 
vitreous ice. Images were collected on a Tecnai TF20 microscope using a 
scintillator-based TVIPS 8k X 8k CMOS camera. b, Fourier power spectrum 
calculated from the micrograph shown in a. c, Two-dimensional class averages 
of the TmrAB-BA6 complex. Fab BA6 is clearly visible in many class 
averages. d, Initial three-dimensional reconstruction of TmrAB-BA6 
determined using the common lines method implemented in SPIDER. e, Two 
different views of the final three-dimensional reconstruction of TmrAB-BA6 


filtered to a resolution of 9.4 A. As in the three-dimensional reconstruction of 
TmrAB-AH5, the density of micelles is split into two halves and tilted with 
respect to each other. The orientation of micelle density is marked with a pair of 
black solid lines and the gap in the micelle density generated by the helix 

H4 from TmrB is marked with a pair of red dotted lines. f, Densities of TmrAB 
in the three-dimensional reconstructions of TmrAB-AHS (khaki) and 
TmrAB-BA6 (grey mesh) overlap. Fabs AH5 and BA6 are indicated with 
arrows. g, An enlarged view to show the interface between TmrAB and BA6, 
which has a linear epitope in the NBD of TmrB. 
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Extended Data Figure 5 | Single-particle cryo-EM of TmrAB alone without 
Fab. a, Raw micrograph of TmrAB alone (~135 kDa) embedded ina thin layer 
of vitreous ice. Images were collected on a Tecnai TF20 microscope using 
scintillator based TVIPS 8k X 8k CMOS camera. b, Fourier power spectrum 
calculated from micrograph shown in a. c, Two-dimensional class averages 
of TmrAB. d-f, Three different views of TmrAB three-dimensional 
reconstruction shown in different (low: grey; high: gold) contour levels. 


Model of TmrAB (in ribbon diagram) was docked into the density map. The 
orientation of micelle density is indicated with pairs of solid black lines in 

f and the gap in the micelle is indicated with a pair of red dotted lines. 

g, h, Densities of TmrAB in the three-dimensional reconstructions of TmrAB 
alone (transparent khaki) and in complex with AH5 (grey mesh) overlap 
each other. 
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Extended Data Figure 6 | Cross correlation between TmrAB-AH5, TmrAB-BA6 is coloured according to the value of local cross-correlation values 
TmrAB-BA6 and TmrAB. Left: density map of TmrAB-AHS is coloured of TmrAB-BA6 with TmrAB-AHS (upper), and with TmrAB (lower). Right: 
according to the value of local cross-correlation values of TmrAB-AH5 with — density map of TmrAB is coloured according to the local cross-correlation 
TmrAB-BA6 (upper), with TmrAB (lower). Middle: density map of value of TmrAB with TmrAB-AH5 (upper) and with TmrAB-BA6 (lower). 
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Extended Data Figure 7 | Atomic model of TmrAB. a,b, Two different views | TmrA is coloured in salmon, and TmrB is coloured in blue. Intracellular loop 4 
of the atomic model of TmrAB, generated by flexible fitting of the sequence is coloured in green. c, Two subunits are arranged with a pseudo-two-fold 
homology model of TmrAB into the density map of the TmrAB-AH5 complex. symmetry. 
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Extended Data Figure 8 | AH5 and BAG inhibit the ATPase activity of the 
TmrAB. a, ATP standard for b-e. b-e, ATP hydrolysis assay at 37 °C. 
Reactions were performed at 37 °C for 20 min with 6.25 1M of TmrAB, 250 uM 
ATP and 2mM MgCl, in the presence of 25 1M of AH5 (c), BA6 (d) or a 
negative control Fab, U33 (e). ATP hydrolysis by TmrAB was reduced in 
the presence of AH5 or BA6 compared with the equivalent reaction in the 
absence of Fabs (b). ATP hydrolysis was not affected by the presence of U33, 
which does not bind to TmrAB (e). f, ATP and ADP standards (250 1M each) 


for g. Two peaks were resolved corresponding to ATP and ADP (black and 
red curves respectively). g, ATP hydrolysis by TmrAB was performed with 
70nM of TmrAB, 250 1M ATP and 2mM MgCh at 60 °C for 30 min. 

h, Identification of the TmrAB nucleotide-binding state. ATP was not detected 
from the protein-extracted aqueous phase (red curve). ATP at an equivalent 
concentration (blue curve) is shown as a control to demonstrate sufficient 
sensitivity for nucleotide detection. 
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Extended Data Figure 9 | Cysteine cross-linking validating the interaction 
between the C-terminal helices of TmrAB in the nucleotide-free state. 

a, Three samples (marked with an asterisk in Fig. 3d) were visualized by 
negative-stain electron microscopy, showing that TmrAB with the double 
cysteine mutation has the native dimeric shape of TmrAB. TmrAB contains an 
exposed native cysteine residue (TmrA-C416) that could not be removed. 

It causes some inter-dimer cross-linking (marked by arrows) under the 


oxidative condition. Scale bar, 50 nm. b, Analytical HPLC demonstrating that 
purified TmrAB containing the A591C/A567C mutation is nucleotide free. 

c, ATP hydrolysis assay indicating that disulphide cross-linking inhibits 

the ATPase activity of TmrAB containing the double cysteine mutation. 
Assays were performed in triplicate for 1h at 60°C with 70 nM reduced or 
oxidized TmrAB, 250 1M ATP, and 2mM MgCl, before analysis by 
analytical HPLC. 
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Extended Data Table 1 | Summary of TmrAB structure determination by single-particle cryo-EM 


Protein sample TmrAB-AH5 TmrAB-AH5 TmrAB-BA6 TmrAB 
Electron microscope TF20 TF30 Polara TF20 TF20 
Pixel size (A) 3.74A 1.96A 3.74A 3.74A 
Number of micrograph 347 639 301 594 

# of particles picked 27,000 131,000 41,000 46,000 
Final # of particles 27,000 102,000 30,000 36,000 
Resolution (A) 10.6A 8.2A 9.44 10.0A 
AccuracyRotations* 6.579° 7.306° 7.094° a 
Accuracy Translation* 1.085 pixel 2.018 pixel 0.994 pixel 1.195 pixel 
AccuracyTranslation* 4.06A 3.96A 3.72A 4.47A 
B-factor N/A -1195A? -1415A? -1851A? 


We determined three three-dimensional reconstructions of TmrAB complexed with a Fab, AH5 or BA6, and a three-dimensional reconstruction of TmrAB alone, using two different microscopes, TF30 Polara and 
TF20. TF30 Polara was equipped with a K2 Summit camera, and TF20 with an 8k x 8k phosphor scintillator based CMOS camera. All three-dimensional reconstructions were determined using RELION. 
Parameters of accuracy (marked with an asterisk) were obtained from RELION. The rotational accuracy of refinement of TmrAB in complex with Fabs (~6.5-7.3°) is significantly better than the TmrAB alone (9.5°), 
suggesting that the Fabs improved the accuracy of angular refinement. 
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CAREERS 


RESPONSIBILITY Royal Society urges students 
and advisers to explore all job options p.403 


TRANSLATION Bridge-building between 
academia and the clinic go.nature.com/ielkkf 


NATUREJOBS For the latest career 
listings and advice www.naturejohs.com 


WORK-LIFE BALANCE 


Lab life with kids 


Balancing research with raising children takes scheduling skills and organization. 


BY KENDALL POWELL 


hile she was in the middle of 
harvesting plates of cells at bio- 
technology company Genentech 


in South San Francisco, California, molecular 
oncologist Ingrid Wertz received a phone call. 
It was her childcare provider, telling her that 
her then-9-year-old daughter had a concussion 
after smacking heads with another child. 
Wertz had been expecting to spend the next 
four hours processing cells as part of an experi- 
ment on cytokine signalling, but instead she 
found herself rushing to the childcare centre. 
Luckily, her work was not ruined — before she 
left, she managed to store it in a freezer so that 
she could pick up the experiment the next day. 
Wertz’s experience is familiar to early-career 


researchers who are the parents of young chil- 
dren. Juggling the responsibilities of laboratory 
and home is difficult at best, especially because 
childbearing and child-rearing years tend to 
collide with the formative stages of a young 
researcher's career. The pressure to produce at 
the bench combined with the duties of child- 
care can push stress levels to breaking point. 
To cope with the extra responsibilities of 
childcare, scientist parents must learn to par- 
tition their days and nights to accommodate 
work and family, and to structure experimental 
protocols so that they do not skimp on quality 
family time. For example, instead of scheduling 
12-hour data-collection time points at 8 a.m. 
and 8 p.m., which are prime breakfast and 
dinner hours, they can shift them to 10 a.m. 
and 10 p.m., stopping back in the lab after the 


children are in bed. Organizational skills are 
important, such as coordinating working days 
and nights with other people providing care. 
Family-friendly national or workplace policies, 
such as reduced teaching responsibilities and 
flexible hours, can ease the burden. But busy 
parents should also make time for themselves 
— whether that time is for a 20-minute jog or 
a date together. 


UNCERTAINTY EXPECTED 

Lab work is inherently erratic: experiments 
can take twice as long as planned or cells may 
not grow, shifting schedules by a day or more. 
But the addition of children to a scientist's life 
adds unpredictability to the already unpredict- 
able. Wertz learned the hard way that she has 
to plan for the unforeseen: her strategies > 
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> include scheduling in lengthy buffer zones 
for large or crucial experiments. 

She also builds in multiple back-up plans. In 
emergencies, such as when her son shoved an 
acorn up his nose at age three, the scientists in 
her group all lend a hand. But back-up plan- 
ning also involves “knowing the non-critical 
times when you can stop in the middle of an 
experiment,” she says. Wertz works out ahead 
of time how to wind things down quickly with- 
out ruining data. That way, if the experiment 
starts to run behind schedule, it can literally 
be put on ice until the next day: stable DNA or 
proteins can be safely tucked away in a freezer. 

When a dance recital or school art show 
waits at the other end of an experiment, Wertz’s 
success depends on time micromanagement. 
“Plan every step a day ahead of time, locate all 
the reagents and schedule incubations during 
times when you are going to be called away to 
meetings,” she counsels. 

If research has to interrupt family time, it 
is best to prepare the family in advance. Asa 
clinical researcher at the University of Texas 
Southwestern Medical Center in Dallas, Jane 
Wigginton’s work centres on testing interven- 
tions for patients suffering from trauma — who 
tend to come into the hospital late on weekend 
nights and during holiday celebrations. Once 
she was paged just before a Fourth of July fire- 
works display and had to leave her family. 

But she and her ex-husband made sure that 
all six kids knew that even if an event such as 
Thanksgiving dinner had to be postponed, 
it would still happen. Looking back, she sees 
that the disruptions provided a good lesson for 
family members. “They learned that it does not 
have to be the exact hour or day to celebrate a 
family moment?’ 


Jens Schuster, a molecular biologist at 
Uppsala University in Sweden, and his wife, 
who works as a nurse supervisor, have also 
learned the art of tiber-organization. Recently, 
their 7-year-old daughter was too ill to attend 
school, prompting a shift in schedules. Schus- 
ter’s wife stayed home with their daughter 
until her hospital shift started. By that point, 
their 17-year-old son was home from school 
and could watch his sister (and 12-year-old 
brother) for a couple of hours. Schuster swung 
home from the lab a bit early, picking up their 
3-year-old daughter from day care on his way. 

Research allows for flexible schedules, he 
says. “Cells don’t care if you come in at mid- 
night or at noon to take care of them.” But he 
constantly checks his and his wife’s schedules 
to sidestep disasters as much as possible. Each 
week, on the family’s shared Google calendar 
(an almost universal tool for researcher fami- 
lies), he reviews upcoming obligations, such 
as his work meetings, his wife's shifts and the 
children’s dentist appointments. 

He finds and resolves scheduling conflicts, 
then plans experiments for days that have at 
least four hours of uninterrupted time. By 
scheduling such blocks about every two weeks, 
he can stay on track with experiments such as 
coaxing stem cells into more-specialized types 
or analysing images on a sophisticated micro- 
scope, he says. “I wonder myself, quite often, 
how do I do all this?” he says. 


PRIORITY CLASH 

Researcher parents concede that work-life 
boundaries can blur when the demands 
of childcare and career advancement peak 
simultaneously. The average age at which US 
researchers gain tenure is 39, when women’s 


TENURE TRAVAIL 


The cost of motherhood 


It is a fact. Having children comes at a higher 
cost to a woman’s academic science career 
than to a man’s. That finding prompted 

all ten campuses of the University of 
California in 2003 to adopt family-friendly 
policy reforms, such as giving mothers two 
semesters and fathers one semester without 
teaching responsibilities and automatically 
extending the allowed time in which to gain 
tenure by one year after the birth of each 
child. 

The policies were brought in after a report 
(go.nature.com/pbhijyd) produced for the 
university by Mary Ann Mason and her 
colleagues at the University of California, 
Berkeley, showed that although every 
academic researcher is busy, women with 
children were working the most, devoting 
more than 100 hours to work, domestic 
and child-care responsibilities each week 


(compared with fewer than 80 hours per 
week for faculty members without children). 

Mason’s research showed that having 
children within five years of obtaining a PhD 
lowers the chances that women will enter 
a tenure-track position and earn tenure. 
Women who have children more than five 
years after finishing their PhD do as well as 
women without children, but there are far 
fewer of these “late babies”, says Mason, 
who is a lawyer and professor at Berkeley’s 
graduate school. 

For her book Mothers on the Fast Track, 
Mason interviewed successful mothers 
from many fields and found that the most 
important contributing factor to their 
SUCCESS Was a partner who felt that their 
spouse’s career was as important as their 
own. “My advice in the book,” she says, “is 
don’t marry a jerk.” K.P. 
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Grzegorz Wicher snaps a selfie with his family. 


fertility has declined sharply. Most early-career 
academic scientists who also want to become 
parents pace themselves by the biological 
clock and have children before they earn ten- 
ure. Those who juggle tenure committees and 
babies say that a shift in perspective has helped 
them to cope with the tension. 

Rebecca Richards-Kortum, a bioengineer 
at Rice University in Houston, Texas, found it 
stressful trying to figure out the optimal timing 
for her first child during her pre-tenure days at 
another institution. She realized after a couple 
of years that she would regret not having a fam- 
ily more than not making tenure. “That was a 
real clarifying moment,’ she says. “It helped me 
let go ofa lot of the stress.” She went on to earn 
tenure and become an award-winning scientist 
and mother of six. 

But generous institutional services also 
smoothed the way for her to juggle lab life and 
childcare obligations. Subsidized, on-site day 
care was key, especially when she was breast- 
feeding (see “Tenure travail’). Some of her best 
impromptu discussions with colleagues about 
grant proposals happened in the day-care cen- 
tre at pick-up times. 

Generous family policies have also helped 
Grzegorz Wicher, who is a senior postdoctoral 
researcher at Uppsala University. For each 
child, the Swedish government guarantees 
13 months of leave at 80% salary, with the time 
off work being split between parents as desired. 
Some academic departments chip in another 
10%. Even so, research careers can suffer from 
long publication gaps and breaks in momen- 
tum, says Wicher, who is also in the middle of 
starting up a cell-culture company. 


To maximize efficiency, he and his wife, 
who is also a senior postdoc at Uppsala, 
planned their research around their respec- 
tive portions of parental leave after the birth 
of their daughter, now aged 8 months. They 
each arranged to finish laboratory projects 
before taking leave, and planned to use nap 
times and evenings at home to work on data 
analysis, manuscripts and grant proposals. 

In their house, mornings at the breakfast 
table are sacrosanct family time; so are the 
hours after their 6-year-old daughter's school 
day and until the children’s bedtime. The cou- 
ple typically works side-by-side in their home 
office for three to four hours after that. “It is 
alittle bit sad, but better than not seeing each 
other at all,” Wicher says. 


WORKING LATE 

In the absence of parental-leave or child- 
care policies, scientist parents turn to other 
strategies to accommodate lab obligations 
and family time. Many with young children 
split up their days and nights, returning to 
the lab during the late evening and working 
remotely when possible. Anthony Barry, an 
associate research fellow at Pfizer Biothera- 
peutics in Andover, Massachusetts, takes his 
laptop home every evening. 

“T get incredibly frustrated ifI get home so 
late that ’'m not getting to see my kids,” says 
Barry, whose sons are aged 7 and 10. Divid- 
ing his duties into work that must be done at 
Pfizer versus what can be done from home 
helps him to complete 8-10-hour work- 
days without missing prime family time. 
“Although people may say it’s horrible to have 
to take work home with you, I’ve found that 
to be the most enabling,’ says Barry. 

Others see the evening hours as the perfect 
time to head back to the lab. Amy Pandya- 
Jones, a postdoctoral researcher in RNA 
biology at the University of California, Los 
Angeles, splits her days to get quality time 
with her 5-year-old and 2-year-old. She goes 
to the lab early in the morning and comes 
home in the early afternoon. About three 
nights per week, after her husband gets home 
from work at around 7 p.m., she returns to 
the lab, working for another four hours. 

She is careful to waste not a second, and 
estimates that she squeezes what would 
normally be a full 8-10-hour workload into 
about 6-8 hours. “You cannot underestimate 
the planning,” she says: she slots in time on 
the weekly calendar even for a trip to the 
supermarket. 

Parents who manage to carve out minutes 
for themselves and their partners relieve 
some of the stress. One practice that helped 
Wigginton to stay sane was stealing an hour 
or two for herself, sometimes for a manicure 
ora pedicure. “I needed just a moment away, 
with nobody sitting in my lap and no pager 
going off? 

Wicher and Pandya-Jones both reserve 


one night a week for dates with their 
spouses — even if it is just a dinner of tacos 
and beer. Wicher and his wife also take it 
in turns to go running on alternate days. 
“It helps to wash the brain,” he says. Jaelyn 
Eberle, a palaeontologist at the University of 
Colorado, Boulder, recently finished a series 
of exercise sessions that started at 5:15 a.m. in 
the mornings. “I realize that ifI don’t get some 
me time — exercise or pottery lessons — then 
I'm not as creative at work,’ she says. 

On Sunday evenings when the kids are 
asleep, Wertz and her husband serve them- 
selves ice cream and sit down to look over their 
family calendar, plan, organize and talk. “We 
make a fun time of it — we get different Ben 
and Jerry’s flavours and sample the new ones.” 

Some researchers hire house cleaners and 
sitters, ask neighbours to drive kids to activi- 
ties or order groceries online to allocate their 
limited hours at home to family time rather 
than chores. It pays to ask for help from 
friends, relatives and even employers — espe- 
cially for single-parent scientists who have 
less support at home. That could mean asking 
grandparents to babysit for a weekend so that 
the researcher can finish up a grant applica- 
tion or asking a boss for a month’s notice 
before scheduling a business trip. 

But it is not all about nappy duty, day 
care and drudge. Researchers see benefits 

for themselves and 


“Cells don’t care for their children 
ifyoucomeinat from their work. 
midnight or at When the children 
noontotakecare were older, Wig- 
of them.” ginton took one or 


two of them (and 
her mother) on conference trips to Paris or 
Hawaii. Wertz enjoys “watching the joy and 
fascination, through the eyes of a child, ofice 
melting and water pouring” as her kids play 
in her lab on weekend visits. 

Richards-Kortum believes that blending 
research and parenting strengthens both 
endeavours. Her experience as a mother has 
helped to shape her research agenda on life- 
saving technologies for premature newborns. 
And her work in Malawi influenced her deci- 
sion to adopt her two youngest daughters 
from Ethiopia. 

“You look at the world through very 
different glasses than before you were a par- 
ent,’ says Wigginton, a mother of six. “My 
children have greatly contributed to any suc- 
cess I've had and to my motivation and drive.” 

Both she and Richards-Kortum have evi- 
dence that the hours devoted to research 
did not leave their children feeling resentful 
towards their scientific research careers — all 
of their university-aged children are follow- 
ing in their mothers’ footsteps, studying 
engineering, bioengineering or medicine. = 


Kendall Powell is a freelance writer in 
Lafayette, Colorado. 
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CAREER PROGRESSION 
Consider all options 


Research institutions should provide 
broader career guidance to their PhD 
students, and students should proactively 
assess their skills and options, according to 
a report by the Royal Society, an influential 
group of scientists based in London. 
“Students must not be regarded as mere 
‘bench monkeys, but nor should they 
themselves be passive in seeking out what 
they need,’ wrote Athene Donald, who 
chaired the group that put together the 
report, in an accompanying opinion piece. 

The Royal Society is certainly not the 
first science organization to highlight 
the grim chances of newly minted 
graduates and postdocs finding faculty 
positions in scientific research and to call 
for universities to provide better career 
preparation (see Nature 516, 7-8; 2014). 

A report from the US National Academies 
in Washington DC, for example, says that 
postdoctoral positions, often seen as the 
default step after a PhD programme, do 
not always help researchers to advance 
their careers, and that research institutions 
should inform PhD students that other 
types of work experience may be more 
beneficial (see go.nature.com/cxli6t). 

Donald, a theoretical physicist at 
the University of Cambridge, says that 
the report, entitled Doctoral Students’ 
Career Expectations: Principles and 
Responsibilities, aims to raise awareness 
of viable career options among students 
and their supervisors, and to bolster 
efforts by university career-guidance 
offices. Improving career awareness 
may require students and schools to 
arrange mentorships beyond a trainee’s 
lab, department or institution. And PhD 
advisers should not imply that a future in 
academia is the only desirable career path 
(see go.nature.com/h9872d). 

Lack of information is a serious issue, 
but merely highlighting careers beyond 
academia may not do much to help people 
to find optimal positions, says Sally 
Hancock, a higher-education researcher 
at the University of York, UK, who studied 
science PhD students at Imperial College 
London. Those who had been exposed 
only to academic research were likely to 
be purists who saw their programme as “a 
zero-sum game in which the objective is 
to achieve an academic position”. Those 
who had already worked outside academia 
were more likely to view non-academic 
options favourably, be proactive about 
exploring other choices and feel less stigma 
about pursuing them. PhD programmes 
might serve students best by incorporating 
work outside the university, she says: 
“Experience is imperative? 
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THE LEFT HANDS OF LOVERS 


BY EMILY ECKART 


he deactivated robots were corralled 

| upright in cages. I navigated the rows 
under the red light, searching for the 

only face that mattered. She was in the back. 
Her shapely features stood out; the other 
robots had been designed ugly on purpose. 

“Rosalind,” I whispered, foolishly 
expecting a response. But there was 
only my breath and the soft whine of 
the lights: no more of her pleasant 
hum, reassuring as a heartbeat. She 
was fully deactivated, her memory 
chip wiped clean. 

Even in deactivation, her face held 
that mysterious semi-smile. I realized 
it was her default expression — not, 
as I had believed, a smile just for me. 


I met Rosalind the week after the 
terrorists fled, when the city still 
gleamed with blood and broken 
glass. The commander introduced 
the robots to our unit. They resem- 
bled both sexes and all races, but 
were uniformly ugly — eyes too close 
together, odd-shaped noses, warts. 

“You'll use these robots for inspect- 
ing buildings abandoned by the terrorists,’ he 
said. “Their exteroceptive sensors can detect 
the vapours of eight different explosives, as 
well as chemical weapons like nerve agents. 
That way we don’t risk human lives.” He 
assigned each soldier a robot. When he got 
to me, there was none left. 

He took me aside. “There’s one robot left 
from the previous generation. Not an ideal 
model, but they're too expensive to waste. 
We thought you could handle it” 

He led me into another room. The robot 
was wearing jeans and a V-neck. She had 
skin the colour of a coffee-flavoured soy 
drink. Her eyes were wide and green. 

“This is RL38501X. We ordered humanoid 
robots after studies showed soldiers worked 
best with machines that they related to. But 
this generation was too relatable. People got 
attached. Last year a man followed his robot 
inside a booby-trapped building and died” 

The robot smiled at me. Her kind expres- 
sion reminded me of my mother, who had 
been killed with the rest of my family in the 

bombing of 32. 


> NATURE.COM “Tt’s a machine,’ he 
Follow Futures: said. “Think of itas it? 
Y @NatureFutures As I directed the 
Ei go.nature.com/mtoodm © SmartCar to the edge 


Strong connection. 


of the city, RL38501X made conversation 
from her hermetically sealed compartment 
in the back. 

“Are you from this area?” she asked. 

“Nope. Transferred here to join the mili- 
tary back in 33” 

“Do you like it?” 

I shrugged. “Sometimes it’s lonely.” 


“That is sad. No one should feel lonely” 

I let her out at the checkpoint. It, I 
reminded myself. Its job was to investigate a 
suspected storage facility for Novichok agent 
powders. As I waited, I found myself hoping 
she would be safe. Be safe, RL... Who could 
remember that serial number? RL should 
signify a name, I decided. Rosalind. 


While other robots got blown up or con- 
taminated by toxic chemicals, Rosalind went 
months without incident. We were transferred 
closer to the battlefront. On our first anniver- 
sary, I drove her to the hill above the new city. 
We looked out over the scarred buildings, the 
roads strewn with litter and wreckage. In the 
morning light, if you forgot what it meant, the 
glittering debris almost looked beautiful. 

“Your hand is like mine,” Rosalind said. 
I had wondered when shed notice. My left 
hand was a prosthesis. I had the same thin 
line on my wrist as she, the same titanium 
joints beneath artificial skin. 

“Tlost my real hand. In an accident.” Flee- 
ing my besieged hometown, I had fallen in 
the road. My hand was crushed by an errant 
SmartCar. 

Rosalind furrowed her eyebrows. “What 
is a ‘real hand’? Is this not real?” 
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She touched my prosthesis. A static shock 
passed between us, sharp and bright, much 
more vivid than the faint sensations I was 
used to. 

“Ow!” I felt guilty as soon as her face 
assumed a hurt expression. I grabbed her 
right hand with my left. She smiled. Quietly, 
we watched the sun rise higher over the city. 


I felt nervous about the squat con- 
crete building Rosalind had to 
inspect that week. Its broken win- 
dows gaped like missing teeth in the 
grin of some delinquent. 

As I waited, I thought about where 
I'd take her that night. I had been 
checking her out of the robot stor- 
age room a couple of nights a week, 
using maintenance as an excuse. So 
far we had listened to a Mozart sym- 
phony and watched a Truffaut film. 
She hadn't yet seen the stars. 

I stared out the windshield, won- 
dering when she would return. Ithad 
been longer than usual. A call came 
in on my earpiece. 

“Your robot has been contami- 
nated. Return to base immediately.” 

“Contaminated? How?” 

“A new strain of bacteria engineered by 
the terrorists. Extremely antibiotic resistant.” 

“But I can't just leave —” 

“Return to base immediately. Your robot 
will be sent to BioLabs for analysis and dis- 
posed of safely. You'll be assigned a new one 
tomorrow.’ 


The disposal facility was five miles from 
base, but adrenaline made it feel close. I 
snuck into the basement and found her 
crammed in a cage like trash. I reached 
through the bars and took her dangling 
hand, drawing it towards me. Her silicone 
skin was perfect, poreless, cold. 

With my right hand, I unscrewed my left 
prosthesis and dropped it on the floor. I 
unscrewed her left hand and attached it to 
my stump. As I walked away, my breath came 
shorter. Did I feel an itching in my chest, a 
dizziness? Would they find my body pock- 
marked and ridden with buboes? I did not 
care. Without Rosalind, I had no one. m 


Emily Eckart studied music and English 
literature at Harvard University. Her fiction 
has appeared in Potomac Review, Literary 
Orphans and elsewhere. You can read more 
of her work at www.emilyeckart.com. 
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